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Abstract: The problem of test of fit for Vector AutoRegressive (VAR) processes 
with unconditionally heteroscedastic errors is studied. The volatility structure is 
deterministic but time-varying and allows for changes that are commonly observed 
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' in economic or financial multivariate series such as breaks or smooth transitions. 

m 



Our analysis is based on the residual autocovariances and autocorrelations obtained 
from Ordinary Least Squares (OLS), Generalized Least Squares (GLS) and Adap- 
tive Least Squares (ALS) estimation of the autoregressive parameters. The OLS 
residuals are the standards estimates of the VAR model errors. To build the GLS 
residuals we use the GLS estimate of the VAR coefficients to estimate the model 
errors that we further standardize by the time-varying volatility. Hence, the GLS 
estimates require the knowledge of the variance structure. The ALS approach is the 
GLS approach adapted to the unknown time- varying volatility that is then estimated 
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by kernel smoothing. The properties of the three types of residual autocovariances 
and autocorrelations are derived. In particular it is shown that the ALS and GLS 
residual autocorrelations are asymptotically equivalent. It is also found that the 
asymptotic distribution of the OLS residual autocorrelations can be quite different 
from the standard chi-square asymptotic distribution obtained in a correctly speci- 
fied VAR model with iid innovations. As a consequence the standard portmanteau 
tests are unreliable in our framework. The correct critical values of the standard 
portmanteau tests based on the OLS residuals are derived. Moreover, modified 
portmanteau statistics based on ALS residual autocorrelations are introduced and 
their asymptotic critical values are obtained. The finite sample properties of the 
goodness-of-fit tests we consider are investigated by Monte Carlo experiments. The 
theoretical results are also illustrated using a U.S. economic data set. 

Keywords: VAR model; Unconditionally heteroscedastic errors; Residual autocorre- 
lations; Portmanteau tests. 
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1 Introduction 



In the econometric analysis numerous tools are routinely used in the framework 
of VAR (Vector AutoRegressive) modeling of time series variables (see Lutkepohl 
(2005) and references therein). Nevertheless it is well known that these tools are 
in general noticeably affected by the adjusted autoregressive order. For instance 
Thornton and Batten (1985), Stock and Watson (1989) or Jones (1989) discussed 
the importance of well specified VAR model for the test of linear Granger causality 
in mean. Therefore the checking of goodness-of-fit of the autoregressive order is com- 
monly performed in applied works before proceeding to the analysis of the dynamics 
of time series. The dominant tests for the adequacy of the autoregressive order 
are the portmanteau tests introduced in the VAR framework by Chitturi (1974) 
and Hosking (1980). The properties of the tests based on the residual autocorrela- 
tions are well explored in the case of stationary processes (see e.g. Francq, Roy and 
Zakoi'an (2005) in the univariate case, Francq and Rai'ssi (2007) or Boubacar Mainas- 
sara (2010) in the multivariate case). Duchesne (2005), Briiggemann, Lutkepohl and 
Saikkonen (2006) and Rai'ssi (2010) developed tests for residual autocorrelation in 
a cointegrated framework with stationary innovations. 

However many applied studies pointed out the presence of non stationary volatil- 
ity in economic time series. For instance Ramey and Vine (2006) found a declining 
volatility in the U.S. automobile industry. Watson (1999) noted a declining volatility 
of short-term U.S. interest rates and increasing volatility for long-term U.S. interest 
rates. Sensier and van Dijk (2004) considered 214 U.S. macroeconomic variables and 
found that approximately 80% of these variables have a volatility that changes in 
time. These findings stimulated an interest on the effects of non-stationary volatil- 
ity in time series analysis amongst econometricians (see e.g. Kim, Leybourne and 
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Newbold (2002) or Cavaliere and Taylor (2007)). 

The present paper is motivated by the need of reliable tools for testing the 
adequacy of the autoregressive order of VAR models with non stationary volatility. 
On one hand, we show that in such cases the use of standard procedures for testing 
the adequacy of the autoregressive order can be quite misleading. On the other hand, 
valid portmanteau tests based on Ordinary Least Squares (OLS) and Adaptive Least 
Squares (ALS) residual autocovariances are proposed for testing the goodness-of-fit 
tests of non-stationary but stable VAR processes. More precisely we consider the 
VAR model of order p > and dimension d > 1 

X t = A 01 X^. 1 + --- + A Qp X t -p + Ut, (1-1) 
u t = H t e t , * = 1,2, ... 

where X t are random vectors of dimension d and the d x d— matrices Aoi, i G 
{1, . . . ,p}, are such that the process (X t ) is stable, that means det(A(z)) ^ for all 
\z\ < 1, with A(z) = Id — Y^h=i A§iZ l . Here, H t is an unknown d x d matrix-valued 
deterministic function of time and (e t ) is an innovation process of unit variance that 
could be serially dependent. Phillips and Xu (2005), Xu and Phillips (2008) already 
studied the problem of estimation of such univariate stable autoregressive processes. 
Patilea and Rai'ssi (2010) investigated the estimation and the test of parameter 
restrictions of multivariate stable autoregressive processes like in (II. ip . 

The usual way to check the adequacy of a stable VAR(p) model, implemented 
in any specialized software, is to assume that the error term u t is second order 
stationary, to fix an integer m > and to test 

U : Cov(u t , u t _ h ) = 0, for all < h < m, (1.2) 

using a classical (Box-Pierce or Ljung-Box) portmanteau test statistic and chi-square 
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type critical values. The errors Ut are approximated using the OLS type estimates 
of the coefficients A^. With the volatility structure we assumed in model (11.11) . 
the variance of u t depends on t and the usual chi-square type critical values are in 
general inaccurate, that is the asymptotic distribution of the classical portmanteau 
statistics under %q is no longer of chi-square type. 

Here we propose two ways to correct this problem. First, we derive the correct 
asymptotic distribution of the classical portmanteau test statistics under "Ho and 
the conditions of model (II. ip . This asymptotic distribution is a weighted sum of 
d 2 m independent chi-square distributions. Next, we indicate how the correct critical 
values can be approximated. 

To explain our second approach, let us notice that Cov(u t , u t ~h) = is equivalent 
to Cov(e t , e t -h) = and the variance of e t does not depend on the time t. Thus an 
alternative idea for checking the adequacy of a model like (II. ip is to test 

n' : Cov(ei, e t _ h ) = 0, for all < h < m. (1.3) 

The values et are approximated by residuals built using a nonparametric estimate of 
the deterministic function H t and Adaptive Least Squares (ALS) type estimates of 
the coefficients Aqi that take into account the volatility structure. More precisely, to 
build the ALS residual vector at time t we use the ALS estimate of the VAR coeffi- 
cients to estimate the VAR model error vector at time t that we further standardize 
by the nonparametric estimate of time- varying volatility H t . Next, we build classical 
portmanteau test statistics using the estimates of e t and we derive the asymptotic 
distribution under Hq. The asymptotic distribution is again a weighted sum of d 2 m 
independent chi-square distributions and the weights can be easily estimated from 
the data. In some important particular cases, including the univariate (i.e. d = 1) 
autoregressive models, we retrieve the standard chi-squared asymptotic distribution. 
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The remainder of the paper is organized as follows. In section [2] we specify 
the framework of our study and state the asymptotic behavior of the OLS and the 
Generalized Least Squares (GLS) estimators of the VAR coefficients. The asymp- 
totic normality of the OLS and the infeasible GLS residual autocovariances and 
autocorrelations is established in section [3j The GLS residuals are defined as the 
standardized (by the true volatility H t ) estimates of the model error vector obtained 
with the GLS estimates of the VAR coefficients. In section H] we highlight the unreli- 
ability of the chi-square type critical values for standard portmanteau statistics and 
we derive their correct critical values in our framework. In section the ALS resid- 
ual autocovariances and autocorrelations are introduced. Since the GLS residual 
autocovariances and autocorrelations are infeasible, we investigate the relationship 
between the GLS and ALS residual autocovariances and autocorrelations and we 
show that, in some sense, they are asymptotically equivalent. This result is used to 
introduce portmanteau tests based on the ALS residuals that have the same critical 
values like those based on the infeasible GLS residuals. In section Owe propose suit- 
ably modified quadratic forms of OLS and ALS residual autocovariances in order to 
obtain alternative test statistics with chi-square asymptotic distributions under the 
null hypothesis. Such modified statistics are nothing but Wald type test statistics 
for testing the nullity of a vector of autocovariances. In section [7] some theoret- 
ical comparisons of the asymptotic power, in the Bahadur sense, are carried out: 
classical Box-Pierce portmanteau test vs. modified quadratic forms of OLS resid- 
ual autocorrelations based test; and ALS vs. OLS residual autocorrelations based 
portmanteau tests. A possible extension of our findings on testing the order of a 
VAR model to the case of heteroscedastic co-integrated variables is briefly described 
in section |HJ The finite sample properties of the different tests considered in this 
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paper are studied by mean of Monte Carlo experiments in section |9j In section [10] 
applications to U.S. economic real data sets are used to illustrate the theoretical 
results: the U.S. balance on services and balance on merchandise trade data, and 
the U.S. energy-transport consumer price indexes. The summary of our finding and 
some concluding remarks are given in section [TTJ The proofs and the tables and 
figures are relegated in the appendices. 

2 Parameters estimation 

p 

In the following weak convergence is denoted by =>■ while — > stands for convergence 
in probability. The symbol (g) denotes the usual Kronecker product for matrices 
and A® 2 stands for A® A. The symbol vec(-) is used for the column vectorization 
operator. We denote by [a] the integer part of a real number a. For a squared matrix 
A, tr(A) denotes the trace. For a random variable x we define || x \\ r — (E \\ x || r ) 1 ^ r , 
where || x || denotes the Euclidean norm. We also define the a— field jF t — cr(e s : s < 
t). The following conditions on the innovations process (u t ) are assumed to hold. 

Assumption Al: (i) The d x d matrices H t are positive definite and the com- 
ponents {gij(r) : 1 < k, I < d} of the matrix G(r) are measurable determinis- 
tic functions on the interval (0,1], such that H t = G(t/T) and, VI < k,l < d, 
sup re( - x j \gk,i{ r )\ < oo and gk,i(-) satisfies a Lipschitz condition piecewise on a finite 
number of some sub-intervals that partition (0, 1] (the partition may depend on k, I). 
The matrix E(r) = G(r)G(r)' is assumed positive definite for all r. 
(ii) The process (e t ) is a-mixing and such that E(e t | Ft-i) = 0, E(e t e' t | jFt-i) = Id 
and sup t || en ||4/i< oo for some /i > 1 and all i £ {1, ... , d}. 
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The second approach we propose for checking the adequacy of a VAR(p) model 
requires the estimation of the innovations 6t, and hence we will need an identification 
condition for G(r) and an estimate of the matrix H t . The condition H t is positive 
definite matrix identifies G(r) as the square root of S(r) and this is a convenient 
choice for the mathematical proofs. Nevertheless one can notice from the following 
that our results could be stated using alternative conditions, like for instance H t 
is a lower triangular matrix with diagonal components restricted to be positive. 
The conditions on the unknown volatility function G(r) are general and allow for 
a large set of dynamics for the innovation variance as for instance abrupt shifts or 
piecewise afline functions. This assumption generalizes to a multivariate framework 
the specification of the innovation variance considered in Xu and Phillips (2008). 
The conditional homoscedasticity of (e$) imposed in (ii) ensures the identifiability 
of £(r). We call a model like in ( II. ip with the innovation process (u t ) satisfying 
Assumption Al a stable VAR(p) model with time-varying variance. 

To introduce the OLS and GLS estimators of the autoregressive parameters, set 
the observations A_ p+1 , . . . ,X equal to the null vector of M. d (or any other initial 
values) and denote by 6*0 = (vec (Aqi)' . . . vec (A Qp )'y G W jd2 the vector of true 
parameters. The equation (II. ip becomes 



X t = (X' t _ 1 ®Id)0o + ut, * = 1,2, ... 



(2.1) 



Ut = H t e t , 



withX t -. 1 = (X' t _ 1 ,... 1 Xl_ p 



' . Then the OLS estimator is 
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where 

T T 

= T- 1 ^X t _iX t / _ 1 ® J„ and Ex = T~ x ^ X t Xt_ v 
t=i t=\ 

Multiplying by Hf 1 on the left in equation ( 12. ip we obtain 

iff 1 ** = AT 1 ^-! ® Wo + e t , 
and then the GLS estimator is 

Ogls = E^vec (%) , (2.2) 

with 

T T 

t=i t=i 
In general, the GLS estimator is infeasible since it involves the true volatility matrix. 

Due to the stability condition, we can write X t = X^i=o^ u *-*' where ipo = Id 
and the components of the ipi's are absolutely summable d x d— matrices. Then 



oo 

,p 



where u p t is given by u p t = l p £§) u t , l p is the vector of ones of dimension p, and 

i>i = diag{ipi, • • • , ipi- P +i}, 

taking ijjj = for j < 0. 

Let lp Xp stand for the p x p— matrix with all components equal to one. Patilea 
and Rai'ssi (2010) proved that under Al 

tH§ gls - ^^^Ar 1 ), (2.3) 

where 

r 1 °° r i 

Ai = J ^{$i{lpxp®X{r))& i \®E(r)- 1 dr, 



i=0 



and 

tH^ols -do) => A/^Ag^Ag 1 ), (2.4) 

with 

A 2 = / V R(l pxp ®£(0)$ ®£(r)dr, 
- 70 i=0 

A 3 = / ^|^(l pxp ® S(r))^|dr®/ d . 
^° i=0 

Moreover, they showed that A3 *A2A3 1 — A^ 1 is positive semi-definite. 

3 Asymptotic behavior of the residual autocovari- 
ances 

Let us define the OLS-based estimates of u t and the GLS-based estimates of e t 

u t = X t -{X' t _ x ®I d )6 LS and e t = H^X t - H^{X' t _ x ® I d ) § GLS . 
The corresponding residual autocovariances are defined as 

T T 

r u oLs(h) = T~ l utu't-h and r GLS (h) = T" 1 £ e t e' t _ h - 

t=h+l t=h+l 

In general the estimated residuals e t as well as the autocovariances t'cLsW are 
not computable since they depend on the unknown matrices H t and the infeasible 
estimator Ogls- 

For any fixed integer m > 1, the estimates of the first m residual autocovariances 
are defined by 

7^° iS = vec{(f u OLS (l), . . . , T OLS (m))) , 7^ GL5 = vec{(f^ 5 (l), . . . , T GLS (m))} . 
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To state the asymptotic behavior of 7„ OLS and 7^, let us define 

Aqi . . . Aop_i Aq p 



K 











V 



/ 



Note that if u t = (u' t , . . . , 0)', X t = KX t -i + u t . Now, let S G = JjJ E(r)dr, S 
/o £(r)® 2 dr and 

m— 1 
i=0 

m-l /- »i n 

A - = S + 1 ) e ^ 1 )' ® / G W ® ^W" 1 ^ ® , (3-3) 

(3.4) 



m— 1 



(3-1) 
(3-2) 



where e m (j) is the vector of dimension m such that the jth component is equal to 
one and zero elsewhere 

Proposition 1 If model U.l\) is correct and Assumption Al holds true, we have 



Ti^ OLS JV(0, £ 



u,OLS\ 



(3.5) 



where 



T^ Gi5 ^A/"(0,£ e ' GiS ), 



where 



,GLS 



I d 2 m - A^ A : . 



In the particular case p = 0, J] u ' OLS = A^ u and Y> e ' GLS = I, 



d 2 , 



1 Recall that our identification condition for Ht implies G(r) = S(r) 1 / 2 . 



(3.6) 
(3.7) 

(3.8) 
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Let us discuss the conclusions of Proposition [T] in some particular situations. In 
the case where £(■) = a 2 (-)Id for some positive scalar function &(■), we have 



m— 1 



i=0 



i=0 



so that in this case the asymptotic distribution of the e t autocovariances estimates 
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e,GLS 



do not depend on the volatility function £(•). Meanwhile, the (asymptotic) 



covariance matrix S"' OLS still depends on the volatility function. 

If we suppose that (u t ) have a time-constant variance E(r) = E u , we obtain 



Ax =E 



€ t X t 



where E u = G U G' U , and 



m 



where 



A 2 = E 



luj,...,uj_ m )' and e 



(e t , . . . , e t _ m 



l'. By straightforward computa- 



tions 



E u ' OLS = I m <g> E® 2 - E 



E 



x t x' t 



E 



ufX' 



^e,GLS 



I<Pm — E 



e t A t 



E 



XtK 



E 



6 i A i 



(3.9) 
(3.10) 



Formula (13. 9p (resp. (I3.10P ) corresponds to the (asymptotic) covariance matrix 
obtained in the standard case with an i.i.d. error process of variance E n (resp. Id), 
see Liitkepohl (2005), Proposition 4.5. Herein, some dependence of the error process 
is allowed. In particular, equation ( 13. 1 Qj) indicates that the homoscedastic (time- 
constant variance) case is another situation where Yl e ' GLS does not depend on error 
process variance E u . 

Proposition [1] shows that in general VAR models with time-varying variance 
the covariance matrix S e,GLS depends on £(•). For the sake of simpler notation, 
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hereafter we write ToLs{h) (resp. 7^ L5 ) (resp. S 01 " 5 ) instead of TqlsW ( res P- 
lm° LS ) ( res P- S"' OL5 ). Similar notation simplification will be applied for T e ,GLs{h), 
7^ GL5 and £ £ > GiS . 

The following example shows that when the error process is heteroscedastic, 
the covariance matrices YP LS and YF LS can be quite different and far from the 
covariance matrices obtained in the stationary case. 



Example 3.1 Consider a bivariate AR(1) model X t = AoX t -i + Ut with true pa- 
rameter Aq equal to the zero 2 x 2— matrix. One can use such a model to study linear 
Granger causality in mean between uncorrelated variables. However in practice one 
has first to check that the error process is a white noise. If we assume for simplicity 
that 

' Mr) 
Mr) J 

we obtain diagonal covariance matrices E OLS = diag{04 X ^, 7 m _i®S OL5 } and YF LS 
diag{T, GLS ,I i{m _ 1} }, with 



E(r) 



V 



\ 



'^OLS 



f JlMrfdr 

tiMr)Mr)dr 
















JoMr)M r )d7 










JoMrfdr ) 



and 



/ 



^GLS 





1 









(J 1 E 1 (r)2S 2 (r)-5dr) 2 
/^SiWEaCr)-!* 
















/ 
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We denote by qxq the null matrix of dimension q x q. Note that the matrix 
I&(m-i) which appears in the expression of TP LS is a consequence of the assump- 
tion Aq = 02x2- If we suppose that the errors are homoscedastic, that is S(r) 
is constant, equal to some £„, we obtain Yf LS = diag^^x^, I m -i <8> S® 2 } and 
YpLS _ diag^^xA, I^m-i)}- Therefore in the OLS approach and if the innova- 
tions variance is spuriously assumed constant, the asymptotic spurious covariance 
matrix Eg LS = diag{§i X ^ I m -i <E> s} is used with 

' SlMr)dr 



■>u,S 



\ /oS 2 (r)dr J 

Now we illustrate the difference between the covariance matrices obtained if we take 
into account the unconditional heteroscedasticity of the process and the case where 
the process is spuriously supposed homoscedastic. We take 

Mr) = 4> + K 2 i - 4)) x l { r>n}(r) (3.11) 

and 

S 2 (r) = a 2 2 + (a 2 21 - a 2 20 ) x l {r > T2 }(r), (3.12) 

where r, G [0,1] with z G {1,2}. This specification of the volatility function is 
inspired by Example 1 of Xu and Phillips (2008) (see also Cavaliere (2004)). In 
Figure (TJ we take T\ = r 2 , af = cr| = 1 and a\ x = 0.5, so that only the break dates 
and a\\ vary freely. In figure [2] only the break dates vary with T\ ^ r 2 in general, 
and <t 2 = cx| = 1, o\ x = a\ x = 4. In Figure [1] and [2] we plot in the left graphics the 
second component E G (2, 2) on the diagonal of YF LS and in the right graphics the 
ratio of S° i5 (6, 6) /Sg L5 (6, 6). 

From the left graphic of Figure [1] it turns out that E GLS (2, 2) could be far 
from zero for larger values of er 2 i and when the breaking point T\ is located early 
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in the sample. From the right graphic of Figure [T] we can see that the ratio of 
E OL5 (6, 6)/E^ L (6, 6) can be far from 1, however the relation between this ratio 
and the variations of n, o~2i is n °t clear. From the left graphic of Figure [2j it 
appears that T, GLS (2,2) can be far from zero. According to the right graphic of 
Figure [21 the relative difference between E OI,5 (6,6) and T,°, LS (6,G) is significantly 
larger when the breaking points t\ and T2 are located at the end of the sample. This 
example shows that the standard results for the analysis of the autocovariances can 
be quite misleading when the unconditional homoscedasticity assumption on the 
innovations process does not hold. 

We also consider the vector of residual autocorrelations: for a given integer 
m > 1, define 

pZ LS = vec { (iW(l), • • • , RoLs{m)) } where R OLS (h) = S^TolsWS- 1 

with Si = Diag{^(l), . . . , ^(d)}, d%i) = T -1 £f =1 &%, and 

Pa,i S = vec { (RglsW, • • • , R G Ls(m)) } where R GLS (h) = ^^^(/i)^" 1 , 

with = Diag{of(l),..., df(cf)}, alii) = T~ l J2t=i <Ht- Since e t has identity 
variance matrix, we can also define 

3 gls = ~GLS 

rb,m Im 

Proposition 2 If model U.l\) is correct and Assumption Al holds true, we have 

r^^^^O, (3.13) 

where 

V OLS = {I m ® (S u <g> 5 u )^}S OL5 {/ m <g> (S u <g> S^- 1 }, 
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where S% = Diag{Ea,xi, • • • , ^G,dd}- Moreover, 

T l 2p° LS =^A/"(0,£ GLS ), (3.14) 

where p^ LS stands for any of p G ^f or pf^f. 

Using Proposition [2} S u and a consistent estimator of S OL5 (that can build in a 
similar way to that of A GLS , see Section HI p. [18"]) . one can easily build a consistent 
estimate of v]/ 01,5 and confidence intervals for the OLS residual autocorrelations. 



4 Modified portmanteau tests based on OLS esti- 
mation 

Corrected portmanteau tests based on the OLS residual autocorrelations are pro- 
posed below. We use the standard Box-Pierce statistic, Box and Pierce (1970), 
introduced in the VAR framework by Chitturi (1974) 

m 

Q° m LS = r^tr(f' Oi5 (/ i )foi 5 (0)f Oi5 (/ i )foi 5 (0)) 

h=l 

= Tl° m LS ' (irn ® f ols(0) ® f o L(0)) lZ LS - (4-1) 

We also consider the Ljung-Box statistic (Ljung and Box (1978)) introduced in the 
VAR framework by Hosking (1980) 

m 

Q° m LS = T 2 E( T -^" ltr ( f, °^ (/l)f ^ (0)foLs(/l)f ^ (0) )- 

h=l 

The following result, a direct consequence of Proposition [1] equation ( 13. 1 3j) . provides 
the asymptotic distribution of Q^ LS and Q^ LS - 
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Theorem 4.1 If model U.l\) is correct and Assumption Al holds true, the statistics 
Qm an d Qm LS converge in law to 

U(5Z LS )=J2 5 ° lSU ?> ( 42 ) 
i=i 

as T — >■ oo, where <5^ L5 = (5f s , . . . , S^f )' is the vector of the eigenvalues of the 
matrix 

A° L5 = (I m ® S G 1/2 ® S G 1/2 )S°^(J m (8 S" 1 / 2 ® S" 1 / 2 ), 
= Jq 1 E(r)dr and ine [/, 's are independent jV(0, 1) variables. 

When the error process is homoscedastic i.i.d. and m is large, it is well known 
that the asymptotic distribution of the statistics Q^ LS an d Qm under the null 
hypothesis can be approximated by a chi-square law with d 2 (m — p) degrees 
of freedom, see Box and Pierce (1970). In our framework, even for large m, the 
limit distribution in (14.21) can be very different from a chi-square law. The following 
example illustrate this point. 

Example 4.1 Consider the bivariate process in Example 13.11 Then 

AZ LS = diag{0 4x4 , J m _! ® A OLS } 



with 



Aqls — 



( Q \ 

(Jo Si(r)dr) 2 

q Jo Si(r)S 2 (r)dr q q 







Jo S 2 (r) 2 dr 

If we suppose that S(r) is constant and A = 02x2; we obtain A^ls = I4, so that 
the asymptotic distribution of and Q^ 13 is x 2 (d 2 (m — p)) with p = 1 and 
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d — 2. However it is easy to see that the d 2 (m — p) non zero diagonal elements in 
A^ LS can be far from one if the error process is heteroscedastic. From the Jensen 
inequality the components Aols^I, 1) and Aols(4, 4) are greater or equal than one. 
For illustration, in the right graphics of Figures [1] and [2] we present the second 
diagonal element of Aols when the volatility function is like in (13. 11 1) - (13. 121) . 

Estimates of the weights which appear in (14. 2 p can be obtained as follows. First, 
let us recall the following results proved by Patilea and Rai'ssi (2010): 

T 

E G .g.2 := T _1 u t ^ 1 u' t ^ 1 (g) Utu't = S G ®2 + o p (l), (4.3) 

t=2 

T 

± G := T" 1 utu't = S G + 0p (l), (4.4) 
t=i 

T 

A 2 := T" 1 ^X^XU <g> u$ = A 2 + Op(l), (4.5) 
t=i 

and 

A 3 :=S^ = A 3 + o p (l). (4.6) 

A consistent estimator of $^ and A^f given in (13. ip and (13. 2p is easily obtained by re- 
placing Aqi, . . . , Aop with their OLS estimators in K and using ( 14. 3 j) and ( 14. 4p . Thus 
from this and the equations (14. 3 p to (14. 6p . one can easily define a consistent estimator 
of A^ L5 . Denote the estimated eigenvalues of A° LS by = (6f s , . . . , f^m)'- 

We are now ready to introduce the OLS residuals-based corrected versions of 
the Box-Pierce (resp. Ljung-Box) portmanteau tests for testing the order of the 
VAR model (II. ip . With at hand a vector ^ LS , at the asymptotic level a, the Box- 
Pierce (resp. Ljung-Box) procedure consists in rejecting the null hypothesis (ll.2p of 
uncorrelated innovations when 

P(Q° LS > Uols(SZ LS ) \X u ...,X T )<a 
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(resp. P(Qm LS ' > UoLs(&m LS ) I -^i> • • • j-^t) < oi). The p-values can be evaluated 
using the Imhof algorithm (Imhof, 1961) or the saddle point method, see e.g. Kuonen 
(1999). 

Let us end this section with some remarks on the particular case E(-) = a 2 (-)Id 
(that includes the univariate AR(p) models with time- varying variance). In this 

case 





-2 




-2 




/ a 2 (r)dr 


Yf>LS _ 


I a 2 {r)dr 




/ a 4 (r)dr 


Jo 




Jo 




Jo 



(4.7) 

and clearly, c a > 1. If in addition p = 0, by Proposition [1] we have £ GL5 = I d 2 m and 
hence 8g LS = c a (l, ■ • • ,1)'. 



5 Adaptive portmanteau tests 

An alternative way to build portmanteau tests for VAR(p) models with time- varying 
variance we consider herein is to use approximations of the innovation A nonpara- 
metric estimate of the volatility function is needed for building such approximations. 
For this purpose we generalize the approach of Xu and Phillips (2008) to the mul- 
tivariate case, see also Patilea and Rai'ssi (2010). Let us denote by A B the 
Hadamard (entry wise) product of two matrices of same dimension A and B. Define 
the symmetric matrix 

T 

S° = Uiu'i, 

1=1 

where, as before the u^s are the OLS residuals and the kl— element, k < I, of the 
d x d matrix of weights wu is given by 

w t i{bki) = ^y]K t i{b k i)^ K u (b k i), 
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with bki the bandwidth and 

K(h . f K{W if ^ 
Kuipu) = < 

if t = i. 

The kernel function If (2) is bounded nonnegative and such that J^° Qo K(z)dz = 1. 
For all 1 < A; < I < d the bandwidth bki belongs to a range Bt = [c m i n bT, Cmaxbr] 
with c m i n , c m ax > some constants and bx 4 at a suitable rate that will be specified 
below. 

When using the same bandwidth bki G -Bt for all the cells of E°, since u i; 
i = 1, ...,T are almost sure linear independent each other, E° is almost sure positive 
definite provided T is sufficiently large. When using several bandwidths bki a regu- 
larization of S° could be necessary in order to ensure positive definiteness. Let us 
consider 

t t = {{t») 2 + v T I d } 1 ' 2 

where Vt > 0, T > 1, is a sequence of real numbers decreasing to zero at a suit- 
able rate that will be specified below. Our simulation experiments indicate that in 
applications with moderate and large samples Vt could be even set equal to 0. 

In practice the bandwidths bki can be chosen by minimization of a cross-validation 
criterion like 

T 

II S t - u t u' t f, 

t=l 

with respect to all b M G Bt, 1 < k < I < d, where || • || is some norm for a 
square matrix, for instance the Frobenius norm that is the square root of the sum 
of the squares of matrix elements. Like in Patilea and Rai'ssi (2010), the theoretical 
results below are obtained uniformly with respect to the bandwidths bki G Bt and 
this provides a justification for the common cross-validation bandwidth selection 
approach in the framework we consider. 

20 



Let us now introduce the following adaptive least squares (ALS) estimator 

9 als = S^vec (£x) > 

with 

T T 

% = T- 1 ^A > t _ 1 A > ;_ 1 ®S~ 1 , and Sx = T^ 1 ^ E^ 1 X t X t '_ 1 . 

The ALS residuals, proxies of the infeasible GLS residuals, are defined as e t = 
H^ l X t — if f ~ 1 (A t / _ 1 ®/d)^AL5, and the adaptive autocovariances and autocorrelations 

T 

r AL s(h) = T ALS (h) = T- 1 MU, R A Ls(h) = S^Aisih)^ 1 , 

t=h+l 

where S e = Diag{d" e (l), . . . , a e (d)}, of (2) = T _1 Ylt=i <Hti an d H t is the nonparamet- 
ric estimator obtained from E t and the identification condition on H t (see Assump- 
tion Al(i)), that is H t = t] /2 . 

Let 7^ z " s = vec{(r^L5(l), • • • , ^ALs( m ))}- Following the notation of the previous 
section, for a given integer m > 1, define the residual autocorrelations 

Pa,™ = vec { (RalsQ), • • • , RALs(mj)} and p^J = ^ LS . 

The main result of this section shows that 7^ i5 an d p^ s are asymptotic equivalent 
to 7^ L5 and p^ 5 . This will allow us to define new portmanteau statistics based on 
the ALS residuals. For this purpose, we need the following assumptions. 

Assumption Al': Suppose that all the conditions in Assumption Al(i) hold 
true. In addition: 

(i) inf rg ( 0i i] A m ,j n (£(r)) > where for any symmetric matrix A the real value 
A m m(^4) denotes its smallest eigenvalue. 

(ii) sup t \\ekt\\& < oo for all k e {!,••-, d}. 
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Assumption A2: (i) The kernel K(-) is a bounded density function defined 
on the real line such that K(-) is nondecreasing on (—00, 0] and decreasing on [0, 00) 
and J R v 2 K(v)dv < 00. The function K(-) is different iable except a finite number 
of points and the derivative K'(-) is an integrable function. Moreover, the Fourier 
Transform of K(-) satisfies J R ^^[^(s)! ds < 00. 

(ii) The bandwidths bki, 1 < k < I < d, are taken in the range Bt = [c m i n bT, c max bT] 
with < c m in < c m ax < 00 and bx + 1/Tb^ 1 — > as T — > 00, for some 7 > 0. 

(iii) The sequence vt is such that Tv\ — >■ 0. 

Below, we say that a sequence of random matrices At, T > 1 is o p (l) uniformly 
with respect to (w.r.t.) b^i G Bt as T — > 00 if sup 1<jfc<Kd sup 6 fc , e s T ||vec (Ap) || — > 0. 
The following proposition gives the asymptotic behavior of variances, autocovari- 
ances and autocorrelations estimators based on the ALS estimator of 9q and the 
nonparametric estimate of the time- varying variance structure S t . The results are 
uniformly w.r.t the bandwidths. 

Proposition 3 // model U.l\) is correct and Assumptions Al' and A2 hold, uni- 
formly w.r.t. b G Bt 

t ,1 
T-^H't^Hi 1 = / (?(r)'®G(r)- 1 dr + o p (l), (5.1) 

% = A 1 + o p (l). (5.2) 

Moreover, given any m > 1, 

THli LS -lg LS } = o p {l) and T^{pi LS -p G m LS }=o p (l), (5.3) 

where p^ LS (resp. p^ LS ' ) stands for any of pf^f and p^^f (resp. p^ 5 and p^^f). 

This asymptotic equivalence result allows us to propose portmanteau test statis- 
tics adapted to the case of time-varying variance. Consider the Box-Pierce type 
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statistic 



h=l 

= T^ LS ' (l m s f^ 5 (0) ® f^(0)) % 



ALS 
m ' 



and 

^b,m — 1 Pb,m Pb,n 

Consider also the Ljung-Box type statistics 



Q A a L m S = t 2 J2( t ~ h)-hr (rU LS (h)rjl s (o)r ALS (h)r2l s (o) 



h=l 

and 

m 

OH 3 = T 2 - h )'^ {r'ALsWALs(h)) . 

h=l 

The following theorem is a direct consequence of (13. 7\i and Proposition [3] and hence 
the proof is omitted. 



Theorem 5.1 Under the assumptions of Proposition^ the statistics Q^m' Qbm 
and Qam) Qbm converge in distribution to 

UK LS ) = ^t S Ul (5.4) 
i=i 

as T — > oo, where S^ LS = (Sf s , . . . , is the vector of the eigenvalues of T, GLS , 

and the Ui's are independent Af (0,1) variables. 

To compute the critical values of the adaptive portmanteau tests, we first obtain 
a consistent estimator of given in (13. 3p by replacing Aqi, . . . , A 0p by their ALS 
estimators in K and using (15. ip . Next we consider the estimate of Ai given in (15. 2p . 
Plugging these estimates into the formula (13. 8p . we obtain a consistent estimator of 
yjGLS w ith eigenvalues 8^ s = (Sf s , . . . , 6°p )' that consistently estimate b~ m LS - 
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There are several important particular cases that could be mentioned. In the 
case of a VAR(O) model (i.e., the process (ut) is observed), TF LS = I d 2 m (see Propo- 
sition [T]) and hence the asymptotic distribution of the four test statistics in Theorem 
15. II would be Xd2 m , that means independent of the variance structure given by £(■). 
In the general case p > 1 where the autoregressive coefficients Aoi, i = 1, ...,p have 
to be estimated, the matrix I& m — YP LS being positive semi-definite, the eigenvalues 
Si ls , • are smaller or equal to 1. Since, in some sense, the unconditional het- 

eroscedasticity is removed in the ALS residuals, one could expect that the Xd2( m - P ) 
asymptotic approximation is reasonably accurate for the ALS tests. Example 13.11 
indicates that this is may not the case, the asymptotic distribution we obtain for the 
ALS portmanteau statistics can be very different from the Xrf2( m _ p ) approximation 
when the errors are heteroscedastic. Finally note that Patilea and Rai'ssi (2010) 
pointed out that using the adaptive estimators of autoregressive parameters instead 
of the OLS estimators lead to a gain of efficiency, so that it is advisable to compute 
the kernel smoothing estimator of the variance function £(•) at the estimation stage. 
Therefore since the kernel estimator of the variance T, t is available for the validation 
stage, the ALS tests are not more complicated than the OLS tests to implement. 

Let us also point out that the eigenvalues 5f s ,...,S^i s will not depend on the 
variance structure when £(•) = a 2 (-)Id (in particular in the univariate case), what- 
ever the value of p is. Moreover, using the arguments of Box and Pierce (1970), see 
also Brockwell and Davis (1991, pp. 310-311), one can easily show that for large 
values of m, the law of U(S^ ) is accurately approximated by a Xd 2 (m-p) distribu- 
tion. However, in the general the multivariate setup the asymptotic distribution in 
(15. 4p depend on the variance function £(■). 
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6 Modified portmanteau statistics with standard chi- 
square asymptotic distributions 

In the previous sections we considered portmanteau tests for which the asymptotic 
critical values are given by weighted sums of chi-squares in the general VAR(p) 
case. Using a suitable change of our quadratic forms one can propose alternative 
portmanteau test statistics with chi-squared asymptotic distributions under the null 
hypothesis. This type of modification was already proposed in the recent time series 
literature but in different contexts. 

First note that as remarked above when testing that the observed process is 
uncorrelated (p = 0) and using the standard portmanteau statistic (14. ip we obtain 
a non standard asymptotic distribution. Then following the approach of Lobato, 
Nankervis and Savin (2002) we consider the modified portmanteau test statistic 

gOL8 = T ^OLS' (At)"^, 

where A^ u = I m ® Eg«2 with Ec®2 defined in equation (14.31) . The invertibility 
of Aj^ u is guaranteed asymptotically by our assumptions. In view of Proposition 
[T] it is clear that under the null hypothesis of uncorrelated observed process, the 
asymptotic distribution of the Q°J^ S statistic is X% m - Recall that this kind of statistic 
correction is not necessary to obtain a standard asymptotic distribution for the 
adaptive portmanteau tests when the non correlation of the observed process is 
tested. 

This approach can be generalized to the case of VAR(p) models with possibly 
p > using the approach of Katayama (2008) for building tests with standard 
asymptotic distributions. In this part we take p < m < T . Let us introduce 

n OLS = $ /$' fA"'")" 1 $ V 1 $' (A^)' 1 
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D GLS = A e,8 f A efi' A eA \e,e' 
m m I m m J m 

so that (I d 2 m - D° LS )$ m = and (I d 2 m - D% LS )A e J > = 0. From the proof of 
Proposition [1] (equation (112. 17p ). it is easy to see that 

(/*m - D° m LS )T^ LS = (I<p m - DZ LS )Thl + o p (l) 

where T 1//2 c^ is asymptotically normal of mean and variance A^[ u . Deduce that 

(I d * m -DZ LS )Th° LS ^Af(0,V), 

where V = {I& m - Dg LS )A% u (I d 2 m - Dg LS )' . Now, notice that 

(Im - n OLS )\ u > u = A"'" - $ /$' (A^V 1 ® =A w,u d^a - n OLS y 

From this and the fact that I d 2 m — Dg LS is a projector, deduce that the matrix AV 
is idempotent, where 

A = (Id? m - D m )'{A^)~ 1 {I d 2 m - D° LS ). 

Moreover, since <3> m is of full column rank d 2 p, it is easy to see that the rank of A is 
d 2 (m — p). A classical result in multivariate data analysis implies 

TOOLS' ( t r>OLS\r/ \u,u\-l( t ri OLS\£,OLS . „2 (n -t\ 

T 7™ {Id?m-D m ) (A m ) [l d 2 m -D m )7 m ^XdHm-p)- I 6 - 1 ) 
Similarly we obtain 

Vffim ~ DZ LS )T^jg LS =► AT(0, J d2m - Dg LS ) (6.2) 

and we deduce 

Tjg LS '(I d 2 m - D° LS )jg LS =► j& (m _ p) . (6 . 3) 

The matrices D^ LS and A^ M could be estimated as suggested in Section HI see 
comments after equation (14. 6ft . and hence a modified portmanteau test statistic 
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based on the OLS estimates 7„ and having standard chi-square critical values 
could be derived from equation (16.11) . On the other hand, using a nonparametric 
estimate of H t one could easily estimate D^ s , see Proposition [3] and the comments 
after Theorem 15.11 Moreover, Proposition [3] allows us to replace 7^ L5 with J^ LS 
and thus to introduce an adaptive portmanteau test with a modified statistic and 
standard chi-square critical values based on equation ( 16. 3p . Clearly one can consider 
similar modification for Ljung-Box type statistics. 

The chi-square critical values are certainly more convenient for portmanteau 
tests. Moreover, in section [7] we provide evidence that the test based on the statistic 
(16. ip could be more powerful, in the Bahadur slope sense, than the OLS estimates 
based test based on the Q^ LS statistic investigated in Theorem 11.11 However, it 
is not necessarily true that the modified procedures presented in this section are 
preferable in applications. Indeed, the empirical evidence presented in Section [9] 
shows that test statistics like in (16.11) and (16.31) are unstable and induce bad levels 
even with series of few hundred observations. 



7 Testing for autocorrelation in heteroscedastic se- 
ries: some theoretical power comparisons 

In this part we carry out some theoretical power comparisons for the tests we consid- 
ered above in the important case where the non correlation of the observed process 
X t = Ut is tested. The case p > 1 will be considered elsewhere. On one hand we 
compare the classical Box-Pierce portmanteau test and modified quadratic forms of 
OLS residual autocorrelations based test introduced in section El On the other hand 
we compare ALS and OLS residual autocorrelations based portmanteau tests. For 
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this purpose we use the Bahadur slope approach that we briefly recall here. Let Qa 
denote a test statistic and, for any x > 0, define Qa(x) = — logPo(Q^ > %) where 
Po stands for the limit distribution of Qa under the null hypothesis. Following Ba- 
hadur (1960) (see also van der Vaart 1998, chapter 14), consider the (asymptotic) 
slope ca(q) = 2 liniT-i-oo T~ 1 Qa{Qa) under a fixed alternative Hi such that the limit 
exists in probability. The asymptotic relative efficiency of the test based on Qa 
with respect to a competing test based on a test statistic Qb is then defined as the 
ratio AREa,b(q) = ca{q)/cb(q)- A relative efficiency AREa,b{q) > 1 suggests that 
the test given by Qa is better suited to detect "Hi because the associated p— values 
wanes faster or equally faster compared to the p— values of the test based on Qb- 

For the sake of simplicity we restrict our attention to the BP statistics and 
consider the case where one tests the non correlation of the observed process, while 
the underlying true process is the autoregressive process of order 1 

ut = But-! + H t e t , (7.1) 

where det(J d - Bz) ^ for all \z\ < 1 and B ^ 0. We keep the notation E(X t X' t ) = 
E(u t u' t ) = S t and we introduce E(H t ete' t H' t ) = H t H' t := S t . Under the alternative 
hypothesis (17. ip we have the relationship 

oo 

E t = Y t B%- i B i '. (7.2) 

i=0 

Using similar arguments to that of the proofs of Lemmas 112.11 to 112.31 in the Ap- 
pendix, deduce that 

T ,1 

TolsW = T- 1 u t<-h = B h / S(r)rfr + o p (l) (7.3) 
t=h+i Jo 

and 



t GL s(h) = T- 1 H- l u t u' t _ h H;}' h = / 
+ — Jo 



G(r)- l B h G(r)dr + o p (l). (7.4) 



t=h+l 
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Using basic properties of the vec(-) operator and the Kronecker product we obtain 
T~ X QZ LS = & | Im ® £ S(r)rfr ® Qf ' Z(r)dr^J \b + o p (1) 

T- 1 Q^ LS = fi'|/ m <8) (Jx(r)dr®I^(jE(r)®E(r)d^ (Jz(r)dr®I^ B + o p (l) 
and 

T- l Qf^ = B' |/ m ® QT 1 G(r)' (8) G(r)- 1 ^ J B + o p (l), 
with i e {a, b} and B = vec {(B 1 , . . . , 5 m )}. 

Proposition 4 (i) If Assumption Al /lo/eZs £nte and £/ie observations follow the 
model ( f7.ip . £/ie asymptotic relative efficiency of the portmanteau test based on Q°J^ S 
with respect to the portmanteau tests based on Q^ s is larger or equal to 1. 

(ii) Suppose that £(•) = o~ 2 (-)Id where cr(-) is some positive scalar function. 
Suppose that Assumptions Al' and A2 holds true and the observations follow the 
model ( [7. ij) . Then asymptotic relative efficiencies of the portmanteau test based on 
Qm LS w ith respect to the portmanteau tests based on Qm LS or Q°^ S are l ar 9 er or 
equal to 1. 

In the first part of Proposition H] the result is obtained without additional restric- 
tion on £(•) while in the second part we impose £(•) = a 2 (-)Id which is for instance 
true in the univariate case (d = 1). In the general multivariate case the portmanteau 
test based on ALS residual autocorrelations does not necessarily outperforms, in the 
sense of the Bahadur slope, the tests based on the OLS residual autocorrelations 
considered above. 
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8 Extending the scope: testing the order of a het- 
eroscedastic co-integration model 

Consider the case of a unit root multivariate process (X t ) with time- varying volatil- 
ity, see for instance Cavaliere, Rahbek and Taylor (2010) or Boswijk (2010). With 

Z ot := X t — Xt-i the model (II. ip can be rewritten in its error correction form 

p 

Z ot = n iX t _i + ^ H-oiZot-i+i + ih (8.1) 

i=2 

u t = H t e t . 

The matrices Hoi are functions of the A^s, and such that the assumptions of the 
Granger representation theorem hold (see for instance Assumption 1 of Cavaliere, 
Rahbek and Taylor (2010)), il i = ao(3' where the d x s-dimensional matrices a 
and Po a re identified in some appropriate way (see e.g. Johansen 1995, p. 72, for 
the identification problem). If p = 1 the sum in (18. ip vanishes. In this section we 
follow Cavaliere, Rahbek and Taylor (2010) and we slightly strengthen Al assuming 
that (et) is iid. Then it follows from their Lemma 1 that (Xt) have a random walk 
behavior and also that (/3' X t ) is stable. By analogy with the homoscedastic case, the 
number s of independent linear stable combinations in ((3' X t ) is the cointegrating 
rank (see section 2.3 of Cavaliere, Rahbek and Taylor 2010, for a detailed discussion 
on the concept of cointegration in our framework). If s = the process (X t ) is not 
cointegrated and the procedures described in the previous sections apply directly to 
the process (Zot). Many contributions in the literature that considered the standard 
homoscedastic framework pointed out that the choice of the lag length is important 
for the contegrating rank analysis, see e.g. Boswijk and Franses (1992, section 4). It 
seems reasonable to imagine that a similar remark remains true with a time- varying 
variance. 
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To describe the estimation procedure of (18. ip . let us define Zu(P) = (X^fl, Z' Qt _ l: 
. . . , Z' ot _ p+l )' for any d x s-matrix /3 and rewrite model (18.11) under the form 

z ot = (z lt (p y ® i d )& + Ut , 

where $ — vec {{ a o, ^-02, ■ ■ ■ , ^-op)}- The estimator of the long run relationships (3 
can be obtained using the reduced rank regression method introduced by Anderson 
(1951). Cavaliere et al (2010) showed that in our framework 

T0-0) = O P (1). (8.2) 

Now, let us define 

$oLs(P) = £ z 1 1 (f3)vec (± Zo (f3) 

where 

T T 

t Zl {P)=T- 1 Y,Zu{P)Z lt {P)'®I d and £ Zo (P) = T~ l ^ Z ot Z u ((3)', 
t=i t=i 

and similarly to (12. 2p let us introduce 



with 

T T 



i=l i=l 



where the volatility E t is assumed known. Next, for any fixed /3, let us define the 
estimated residuals 

Ut{P) = Z ot - (Z lt (J3)' ® I d )$OLs(P), 

et(P) = H~ l Z ot - H-\Z lt (py <g> I^GLsiP) 
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and the corresponding estimated autocovariance matrices 

T T 

f Ls(h,(3) = T- 1 UtWt-hW)' and t GLS {h, (3) = T" 1 ^ e t (f3)e t „ h ((3)' . 

t=h+l t=h+l 

From ( 18.2ft we obviously have 

f O Ls(hJ)=f u OLS (h,(3 ) + o p (T- 1 2) and f Gi5 (/i J) = ?gls% A>) + o p (T^). 
Defining 

T 

S°(/3) = ^^0n i (/3)n l (/3) / , 

8=1 

it is also obvious that 

t° t 0) = nm+o P (T^). 

It is clear now that one can treat /3 as known and, following the lines of the pre- 
vious sections, one can use T Ls(h,f3) and the ALS version of T GLS (h,(3) to build 
portmanteau tests for checking the order p of the model (18. ip . 



9 Monte Carlo experiments 

In the sequel L£>^ i5 and LB^ S will denote the Ljung-Box type portmanteau tests 
based on the adaptive approach with non standard distributions ( 14. 2 p and ( 15. 4p . 
For the sake of brevity, only the results obtained with the test statistic Qam w ^ 
be reported. Moreover, since we found similar results for the BP and LB tests, 

we only report on LB tests. The LB tests based on modified statistics which are 

- — ols - — ALS 

built using the results in Section [6] are denoted by LB m and LB m . If we assume 
that the volatility function is known, one can also build portmanteau tests using the 
result in (I3.14p in a similar way to the adaptive portmanteau tests. These infeasible 
tests denoted by LB^ and LB m will serve as a benchmark for comparison with 

• ALS 

the ALS tests LB^ and LB m . It is clear that the asymptotic critical values of 
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the ALS tests are the same as the critical values of the GLS tests. In this section 
we investigate by simulations the finite sample properties of the ALS and GLS 
portmanteau tests and we compare them with the OLS estimation-based tests. In 
the next section we study the model adequacy of two real data sets: the US energy 
and transportation price indexes for all urban consumers on one hand and the US 
balances on merchandise trade and on services on the other hand. 



9.1 Empirical size 

Our Monte Carlo experiments are based on the following Data Generating Process 
(DGP) specification 

\ / 



( y \ ( 



\X 2t j 



0.3 -0.3 
-0.1 
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a 



7 



V 



X-2t-2 



\ ( \ 



\ u * 7 



(9.1) 



where a = in the empirical size part of the study and a = —0.3 in the empirical 
power part. The autoregressive parameters are such that the stability condition hold 
and are inspired from the ALS estimation obtained for the U.S. balance on services 
and merchandise trade data (see Table [5] below) . In the case of smooth variance 
structure we consider 

( (1 + 7Tir)(l + w 2 ) w(l + 7Tir)5(0.1 + vr 2 r)5 ^ 

£(r) = 

, w{l + 7Tir)2(0.1 + 7l2r)2 (0.1 + 7T2r) 

where we take 7Ti = 250 and 7r 2 = 5. In order to investigate the properties of the 
tests when a volatility break is present we also consider the following specification 



(9.2) 



7 



/ 
V 



(6 + A(r))(l + zu 2 ) zu(Q + /i(r))s(0.5 + / 2 (r))i 
w(0.5 + / 2 (r))3(6 + A(r))3 (0.5 + / 2 (r))(l + p 2 ) 
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(9.3) 
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with /i(r) = 54 x l(r>i/2)( r ) an d f2(f) = 3 x l( r >i/2)(r). In this case we have a 
common volatility break at the date t = T/2. In all experiments we fix w = 0.2. 
These volatility specifications are inspired by the real data studies we consider in the 
next section. For instance to fix the (1, 1)— component of the specification (I9.2p we 
noticed that the last estimated variances with the balance services and merchandise 
trade data are all greater than 200. In the energy-transport price indexes data 
some of the last estimated variances of the first component are even greater than 
300. The amplitudes of the functions f\(-) and /2(-) in the volatility specification 
with an abrupt break defined in (19. 3p were calibrated close to the means of the first 
T/2 estimated volatilities and of the T/2 last volatilities for the balance services 
and merchandise trade data. To assess the finite sample properties of the tests 
under comparison when the errors are stationary, we also considered standard i.i.d. 
Gaussian error processes. For each experiment iV = 1000 independent trajectories 
are simulated using DGP (I9TT]) . Samples of length T = 50, T = 100 and T = 200 
are simulated. 

We first study the empirical size of the tests taking a = 0, and adjusting a VAR(l) 
model to the simulated processes. The portmanteau tests for the non correlation 
of the error terms are applied using m = 5 and m = 15 at the asymptotic nominal 
level 5%. The results are given in Tables [TJ [2] and [3j Since N = 1000 replications are 
performed and assuming that the finite sample size of the tests is 5%, the relative 
rejection frequencies should be between the significant limits 3.65% and 6.35% with 
probability 0.95. Then the relative rejection frequencies are displayed in bold type 
when they are outside these significant limits. Note that the distributions (14. 2 p 
and ( 15. 4p are given for fixed m, while the Xd2( m - P ) approximation (see discussion 
after Theorem 14.11 above) should be accurate only for large m. Therefore we only 
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comment the results for small and moderate samples (T = 50 and T = 100) for 
the standard portmanteau tests with m = 5, while the results for large samples 
(T = 200) are considered when m = 15 is taken. 

In Table [T]i.i.d. standard Gaussian errors are used, that means E(-) = I2. In this 
simple case the relative rejection frequencies of the tests converge to the asymptotic 
nominal level. In general we do not remark a major loss of efficiency of the LB^ LS 
test when compared to the standard test. Therefore one can use the LB* LS test 111 
case of doubt of the presence of unconditional heteroscedasticity. The same remark 
can be made for the LB^ L test when m is small. However we note that the LB^ LS 
is oversized for small samples and when m is large. It also appears that the tests 
based on modified statistics are oversized when m is large. This can be explained 
by the fact that the matrices A^' A e J* and $^(A^ n ) _1 $ m are difficult to invert in 
such situations. 

In Table |2] heteroscedastic errors with an abrupt volatility break are considered, 
while the trending specification (19.21) is used for the volatility in Table O In line with 
the theoretical the relative rejection frequencies of the ALS, GLS and OLS tests con- 
verge to the asymptotic nominal level. As expected the standard portmanteau test 
is not valid. In general it emerges from this part that the LB^ LS and LB^ LS tests 
have similar results. Then it seems that estimating the volatility entails no major 
loss of efficiency when building this kind of test. We did not found clear advantage 
for the LB^ S when compared to the LB^ LS in the presented experiments. However 
note that the asymptotic distribution (14. 2 p of the standard portmanteau statistic 
seems estimated with less precision than the asymptotic distribution of the ALS 
portmanteau statistic. For instance we see in Table H] that for m = 5 the standard 
deviations of the ALS weights are lower or equal to the standard deviations of the 
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OLS weights. We found that this difference is more marked for m = 15 which may 
lead to problems for the control of the error of first kind for the LB° LS as already 
noted in the homoscedastic case (see Table [T] for m = 15). Other simulation results 
not reported here show that the LB^ LS test can be oversized when m is large as 
in the homoscedastic case with m — 15 and we noted that the estimation of the 
weights requires a relatively large number of observations for the OLS approach. A 
possible explanation is also that Sf ls G [0, 1] while 5f s G [0, oo) for % G {1, . . . , d 2 m}, 
which may proceed more instable estimation for Sf s in many cases. In addition the 
energy-transportation price indexes example below show that the estimation of the 
weights may be problematic in the OLS case. Therefore we recommend to choose 
small m when the samples are small and use large m only when the samples are 
large despite the asymptotic results hold true for fixed m when the LB° LS is con- 
sidered. We again note that the tests with modified statistics are in general clearly 
oversized. A possible explanation is that White (1980) type correction matrices are 
inverted in the modified portmanteau statistics which may lead to oversized tests as 
pointed out in Vilasuso (2001) in the stationary case. We can conclude that with 
a data generating process close to our simulation design the LB^ LS test controls 
reasonably well the error of the first kind in all the situations we considered. 

9.2 Empirical power 

In the empirical power part of this section, we examine the ability of the different 
tests to detect underspecified autoregressive order. The power investigation is real- 
ized in the Bahadur sense, that is the sample size is increased while the alternative 
is kept fixed. More precisely we set a = —0.3, and we adjusted a VAR(l) model to 
the simulated VAR(2) processes with T = 50, 100, 200, 300. We simulated N = 1000 
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independent trajectories using DGP (19.11) with standard Gaussian innovations and 
heteroscedastic volatility specifications (19. 2\i and (19. 3p . The non correlation of the 
error process is again tested at the asymptotic nominal level 5%, but taking m = 10 
in all the experiments. In Figure [3] we consider the homoscedastic case, errors where 
an abrupt volatility shift is observed and errors with smoothly varying variance. 

When the variance is constant it appears that the tests with non standard distri- 
bution have similar power to the standard test when the errors are homoscedastic. 
Therefore we again note that there is no major loss of efficiency when the tests 
with modified distribution are used while the variance is constant. The tests with 
standard distribution may seem more powerful when the samples are small, but this 
mainly comes from the fact that these tests are oversized. When the errors are 
heteroscedastic the standard test seems more powerful than the other tests. How- 
ever the standard test is oversized and in this case and the comparison is again not 
fair. A similar comment can be made when the tests with modified distribution are 
compared to the tests with standard distribution. It emerges that the LB^ LS test 
is more powerful than the LB^ . The relation between the powers of LB m and 

• ALS 

LB m is not clear. Finally we can remark that in the presented experiments the 
GLS type tests are not necessarily more powerful than the ALS tests. 

The results of section [7] are also illustrated. To this aim N = 1000 independent 
trajectories are simulated using a VAR(l) DGP with A m = —0.3/2- We consider 
heteroscedastic errors with trending behavior volatility specification: 



' (1 + TTir) 



\ (l+7rir) j 
where we take 7Ti = 150, and a volatility with an abrupt shift at T/2: 
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where fi(r) = 10 x l( r >i/2)( r )- The lengths of the simulated series are T = 
50, 100, 200. The non correlation of the observed process is tested at the asymp- 
totic nominal level 5% taking m = 10. From Figure H] the LBols test may appear 
more powerful than the LBals test when the samples are small. However this comes 
from the fact that the LBols test is strongly oversized in small samples. In accor- 
dance with the theoretical we see that the LBals test clearly outperform the LBols 
and LBols as the samples become large. 

10 Illustrative examples 

For our real data illustrations we use two U.S. economic data sets. First we consider 
the quarterly U.S. international finance data for the period from January 1, 1970 to 
October 1, 2009: the balance on services and the balance on merchandise trade in 
billions of Dollars. The length of the balance data series is T = 159. We also consider 
monthly data on the U.S. consumer price indexes of energy and transportation for 
all urban consumers for the period from January 1, 1957 to February 1, 2011. The 
length of the energy-transportation series is T = 648. The series are available 
seasonally adjusted from the website of the research division of the Federal Reserve 
Bank of Saint Louis. 

10.1 VAR modeling of the U.S. balance trade data 

In our VAR system the first component corresponds to the balance on merchandise 
trade and the second corresponds to the balance on services trade. From Figure [5] it 
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seems that the series have a random walk behavior. We applied the approach of unit 
root testing proposed by Beare (2008) in presence of non constant volatility using the 
Augmented Dickey- Fuller (ADF) test for each series. The ADF statistic is 0.72 for 
the merchandise trade balance data and is —0.15 for the services balance data. These 
statistics are greater than the 5% critical value —1.94 of the ADF test, so that the 
stability hypothesis have to be rejected for the studied series. Furthermore we also 
applied the Kolmogorov-Smirnov (KS) type test for homoscedasticity considered by 
Cavaliere and Taylor (2008, Theorem 3) for each series. The KS statistic is 3.05 for 
the merchandise trade balance data and is 7.62 for the services balance data. These 
statistics are greater than the 5% critical value 1.36, so that the homoscedasticity 
hypothesis has to be rejected for our series. Therefore we consider the first differences 
of the series to get stable processes, so that the evolutions of the U.S. balance data 
are studied in the sequel. From Figure [6] we see that the first differences of the 
series are stable but have a non constant volatility. We adjusted a VAR(l) model to 
capture the linear dynamics of the series. The ALS and OLS estimators are given 
in Table [5j The standard deviations into brackets are computed using the results 
(12. 3p and (j2.4p . In accordance with Patilea and Rai'ssi (2010), we find that the 
ALS estimation method seems better estimate the autoregressive parameters than 
the OLS estimation method, in the sense that the standard deviations of the ALS 
estimators are smaller than those of the OLS estimators. The bandwidth we use for 
the ALS estimation, b = 7.67 x 10~ 2 , is selected by cross-validation in a given range 
and using 200 grid points (Figure [7]) . 

A quick inspection of Figures [8] and [9] suggests that the OLS residuals have a 
varying volatility, while the stationarity of the ALS residuals is plausible. Thus it 
seems that the volatility of the error process (tt t ) is satisfactorily estimated by the 
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adaptive method. Now we examine the possible presence conditional heteroscedas- 
ticity in the ALS residuals. From Figure [10] we see that the autocorrelations of the 
squared ALS residuals components are not significant. In addition we also consid- 
ered the ARCH-LM test of Engle (1982) with different lags in Table [6] for testing the 
presence of ARCH effects in the ALS residuals. It appears that the null hypothesis 
of conditional homoscedasticity cannot be rejected at the level 5%. These diagnos- 
tics give some evidence that the conditional homoscedasticity assumption on (ej) is 
plausible in our case. To analyze the changes of the variance of the OLS error terms, 
we plotted the estimated variances and cross correlation of the components of the 
error process in Figures [TT1 and [T2l It appears from Figure [11] that the variance of 
the first component of the residuals does not vary much until the end of the 90's 
and then increase. Similarly the volatility of the second component of the residuals 
does not vary much until the end of the 80's and then increase. From Figure [T2l we 
also note that the correlation between the components of the innovations seems to 
be positive until the beginning of the 90's and then become negative. 

Now we turn to the check of the goodness-of-fit of the VAR(l) model adjusted 
to the first differences of the series. To illustrate the results of Proposition [1] we 
plotted the ALS residual autocorrelations in Figures [T3] and [J4] and the OLS residual 
autocorrelations in Figures [15] and [16] where we denote 

R ° Ls(h) - — mm — and Rals{h) ~ — mm — • 

The ALS 95% confidence bounds obtained using ( 13. 14ft and ( 15. 3ft are displayed in 
Figures [13] and [TU In Figures [15] and [16] the standard 95% confidence bounds 
obtained using (13. 9p and the OLS 95 % confidence bounds obtained using (I3.13P are 
plotted. We can remark that the ALS residual autocorrelations are inside or not 

10 



much larger than the ALS significance limits. A similar comment can be made for 
the OLS residual autocorrelations when compared to the OLS significance limits. 
However we found that the OLS significance limits can be quite different from the 
standard significance limits. This can be explained by the presence of unconditional 
volatility in the analyzed series. In particular we note that the R"ols(5) is far from 
the standard confidence bounds. We also apply the different portmanteau tests 
considered in this paper for testing if the errors are uncorrelated. The test statistics 
and p-values of the tests are displayed in Tables [7] and [HJ It appears that the p- 
values of the standard tests are very small. Therefore the standard tests clearly 
reject the null hypothesis. We also remark that the p-values of the modified tests 
based on the OLS estimation and of the adaptive tests are far from zero. Thus in 
view of the tests introduced in this paper the null hypothesis is not rejected. These 
contradictory results can be explained by the fact that we found that the distribution 
in (14. 2p is very different from the Xd^(m-p) standard distribution. For instance we 
obtained swp ie ^ d 2 m y j<5° i,s j = 11.18 for m = 15 in our case. Our findings may 
be viewed as a consequence of the presence of unconditional heteroscedasticity in 
the data. Since the theoretical basis of the standard tests do not include the case 
of stable processes with non constant volatility, we can suspect that the results of 
the standard tests are not reliable. Therefore we can draw the conclusion that the 
practitioner is likely to select a too large autoregressive order in our case when using 
the standard tools for checking the adequacy of the VAR model. From Table [S] we 
see that the OLS and ALS statistics are quite different. We also noted that the 
weights (not reported here) of the sums in (14. 2 p and in (15.41) are quite different for 
our example. 
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10.2 VAR modeling of the U.S. energy-transportation data 

In this example the first component of the VAR system corresponds to the trans- 
portation price index and the second corresponds to the energy price index. We first 
briefly describe some features of the energy-transportation price indexes. In Figure 
[T71 we again see that the studied series seems to have a random walk behavior and 
then we again consider the first differences of the series. The KS type statistic is 
7.05 for the energy price index and is 6.81 for the transportation price index, so that 
the homoscedasticity hypothesis has to be rejected for these series. From Figure 
[IS] we see that the first differences of the series are stable but have a non constant 
volatility. We adjusted a VAR(4) model to capture the linear dynamics of the se- 
ries. The results in Table indicate that the ALS approach is more precise than the 
OLS approach for the estimation of the autoregressive parameters. The bandwidth 
obtained by the cross-validation for the ALS estimation is b = 9.53 x 10~ 2 (Figure 
[TP"]) . From Figure |2~T1 we see that the OLS residuals seem to have a varying volatility. 
The stationarity of the ALS residuals is plausible (Figure 120]) and ARCH-LM tests 
(not reported here) show that the conditional homoscedasticity of the ALS residuals 
cannot be rejected. From Figure |2"21 we see that the shape of the variance structure 
of the components of the OLS residuals are similar. More precisely it can be noted 
that the variance of the components of the OLS residuals is relatively low and seems 
constant until the beginning of the 80's. The OLS residual variance seems to switch 
to an other regime where the variance is increased but constant from the beginning 
of the 80's to the end of the 90's. The volatility of the OLS residual variance seems to 
increase from the end of the 90's. From Figure [12] we also note that the components 
of the OLS residuals seem highly correlated. 

Now we check the adequacy of the VAR(4) model. The ALS and OLS residual 
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autocorrelations are given in Figures 124] [25] and [26] [27] The OLS residual autocor- 
relations with the standard bounds are given in Figures [28] and [29] From Figures 
[241 and [25] it can be noted that the ALS residual autocorrelations are inside or not 
much larger than the ALS significance limits. The OLS confidence bounds seems 
not reliable since we remark that some confidence bounds (corresponding to the 
Rq LS (2)'s) are unexpectedly much larger than the OLS residual autocorrelations. 
From Figures [28] and [29] it can also be noted that some of the OLS residual auto- 
correlations are far from the standard confidence bounds. However considering the 
standard confidence bounds in presence of non constant variance can be misleading 
in view of our theoretical findings. In addition we also remark that some standard 
confidence bounds (corresponding to the Rj^ LS (2), R^sity and R^ LS (8), -Rols(8)) 
are unexpectedly far from the residual autocorrelations. The non correlation of the 
residuals is tested using the different portmanteau tests considered in this paper. 
The test statistics and p- values of the tests are displayed in Tables [10] and [11] The 
standard tests again lead to select complicated models since the adequacy of the 
VAR(4) model is rejected. We also remark that the p-values of the OLS tests are 
very large. In fact we found that the OLS tests do not reject the hypothesis of the 
adequacy of a VAR(l) model for the studied series. From Table [121 we see that some 
of the estimated weights for the asymptotic distribution of the standard statistic 
take very large values. It appears that the OLS method seems not able to esti- 
mate correctly the asymptotic distribution of the standard statistics and may be 
suspected to have a low power in this example. Then the OLS portmanteau test 
seem not reliable in this example on the contrary to the ALS portmanteau test. In- 
deed it can be noted that the asymptotic distribution seems well estimated. Finally 
note that we found that the estimators of the A^f A^f and <£>^(A^ n ) _1 $ m are not 
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invertible, so that the LB m and LB m are not feasible in this example. In view 
of the different outputs we presented, it seems that the LB^ LS test is the only test 
which give reliable conclusions in this example. This may be explained by the fact 
that the variance strongly change in this second example when compared to the first 
example and hence make the tests based on inverted matrices or the tests which do 
not exploit the variance structure difficult to implement. 



11 Conclusion 

In this paper the problem of specification of the linear autoregressive dynamics of 
multivariate series with deterministic but non constant volatility is studied. Consid- 
ering such situations is important since it is well known that economic or financial 
series commonly exhibit non stationary volatility. The unreliability of the standard 
portmanteau tests for testing the adequacy of the autoregressive order of VAR pro- 
cesses is highlighted through theoretical results and empirical illustrations. From 
the statistical methodology point of view, the main contribution of the paper is 
two-fold. In the setup of a stable VAR with time-varying variance, (a) we show how 
to compute corrected critical values for the standard portmanteau statistics imple- 
mented in all specialized software; and (b) we propose new portmanteau statistics 
based on the model residuals filtered for the non constant variance. Moreover, we 
provide some theoretical and empirical power comparisons of the two approaches 
and we show that they are well-suited for replacing the usual test procedures even 
when the volatility is constant. The new portmanteau statistics require the estima- 
tion of the time-varying variance that is done by classical kernel smoothing of the 
outer product of the OLS residuals vectors. Then another contribution of the paper 
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is represented by the fact that our asymptotic results are derived uniformly in the 
bandwidth used in the kernel. This makes the theory compatible with the practice 
where people usually use the data to determine the bandwidth. Our theoretical 
and empirical investigation could be extended to other setups, like for instance the 
co-integrated systems. We briefly mention this extension but a deeper investigation 
is left for future work. 



12 Appendix A: Proofs 

For the sake of a simpler presentation, hereafter we stick to our identification con- 
dition for H t and hence we replace everywhere G(r) by S(r) 1 / 2 . Recall that 



i=0 



i=0 



(see pages 191 and [TT]) . Let us introduce 



and 



= (^i «l^w)'=«:i^{.i)' 



^i-l — ( e t~l 5 • • • 5 e i-m' -^i-l) — ( e t-l5 X t _ X) 



for a given m > 0. To prove Propositions [T] and [2] we need several preliminary results 
that are gathered in Lemmas 112.11 to 112.31 below. 

Lemma 12.1 Under Assumption Al we have 



lim E 



X^X'^J =$^i{lpxp®£(r)}$:=n(r), (12.1) 



lim E 
lim E 

T^oo 



i=0 

1 [Tr]-1 1 [Tr]-1 

-ye fe' 

1 [Tr]-1 1 [Tr]-1 
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;i2.2) 
;i2.3) 



for values r G (0, 1] where the functions gij(-) are continuous. The matrices in \12.2\) 
and U2.3\) are given by 



Q u (r) 



and 



' J m ®E(r) 0«(r) 
\ 



m— 1 



, e»(r) = X;{e m (<+l)ep(l) , ®E(r)}Jf <l 



i=0 



m— 1 
i=0 



\ e;M' n(r) y 

Proof of Lemma 112.11 Statement 112.11 is a direct consequence of Lemma 7.2 in 
Patilea and Raissi (2010). For the proof of ( 112. 2p we write 



i=0 
m— 1 

i=0 
m— 1 

^{e m (* + l)e p (l)' <g> H^H't^K* 



8=0 

m— 1 



J]{e m (* + l)e p (l)' <g> £((t - i - l)/T)}iT'. 

8=0 



Therefore 



m— 1 



lim ^{e m (^ + l)ep(l)' ® E(([Tr] - i - 1)/T)}ZT 

8=0 

m— 1 

^{e m (^ + l)e p (l) / ®E(r)}^'. 



Similarly we have 



8=0 



hm E{vfa ] _ 1 u$ r] _ 1 ) = I m ®V(r) 



(12.4) 



2 Here we make a common abuse of notation because in Assumption Al the matrix-valued 
function £(•) is not denned for negative values. To remedy this problem it suffices to extend the 
function £(•) to the left of the origin, for instance by setting E(r) equal to the identity matrix if 
r < 0. 
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so that using ( 112. ip we obtain the result (112. 2p . The proof of (1 1 2 . 3 [) is similar. □ 



Let us define = vec(T"_ 1 T"l 1 £g> u t u' t ) and v\ = vec(T e t _ 1 T e t '_ 1 £g> ete' t ). The 
following lemma is similar to Lemma 7.3 of Patilea and Rai'ssi (2010), and hence 
the proof is omitted. 



Lemma 12.2 Under Al we have 

T 



T- 1 MutXU 0, (12.5) 



vec\ut^i-i) ~ P 

t=i+l 
T 



T- 1 vecietXU) ^ 0, (12.6) 

t=i+l 

for i > 0, and 

T T 

t- 1 w^A^ftr" 1 E vec { E «iXU)}, ( 12 - 7 ) 

t=m+l t=m+l 
T T 

t- 1 ^AjAtor 1 Yl ( 12 - 8 ) 

i=m+l i=m+l 

In addition we have 

T T T 

i=l t=l t=l 

(12.9) 

T T T 

t- 1 E «? fe^ 1 E *w = ^ T_1 E uec {^( T t-i T ti) ® J 4 • 



*=i *=i t=i 



12.10) 



Lemma 12.3 Under Al we /iai>e 



S 1 = A 1 + 0p (l), (12.11) 

E^ = A 3 + o p (l). (12.12) 
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In addition we also have 



where 
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Proof of Lemma 112.31 Statements (112. lip and (112. 12[) lemma are direct conse- 
quences of Lemma 7.4 of Patilea and Rai'ssi (2010). We only give the proof of (112. 14j) 
and (I12.13p . To P rove flUED, using the well known identity (B <g) C)(D <g) F) = 
(BD) £g> (CF) for matrices of appropriate dimensions, we obtain 

J t -\ru s e t )(rU ® e' t ) j,- 1 = j t -\rurU ® e^) j- 1 . 

From (112. 10p we write 



r-^JT'CTt-iiti®^)^ 



i=l 
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Now let us denote the discontinuous points of the functions <7y(.) by £1,^2, • • • ,£g 



where q is a finite number independent of T. We write 
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Then from (1 1 2 . 3 [) we obtain 
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\ 
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Id?m 

I dp ®Y>{r) 
and S(r) 1 / 2 = G(r) = H [Tr] . Noting that J f _1 ( T t-i 

Cg) et) are martingale differences, 
we obtain the result (j!2.14p using the Lindeberg central limit theorem. Using re- 
lations ( 112. 2h and ( 112. 9ft . the proof of (112.13}) is similar. Finally, the equivalent 
compact expressions of S e and S u can be easily derived using elementary properties 
of the Kronecker product. □ 

Proof of Proposition [JJ First we establish the result (I3.5p . Let us define 

T 

T u (ti) = T~ x u t u' t _ h and c£ = vec{(r u (l), . . . , Yjm))} . 

t=h+l 

Let us first show the asymptotic normality of T 1,/2 (c^ l / , (9ols ~ $o)')'- Note that 

T ( T 

c u m = T^Y,^ti®n t and 6 ls ~ #0 = £f\ T~ l ^(^-x ® «t) 
t=i I t=i 
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with v%L x = (l(o,oo)(* - 1) x u't-v ■ ■ ■ > l(o,oo)(* - m) x i4_ m )'- From (U2.12p we write 
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Then we can obtain from (112. 13p 
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with H u defined in LemmaGJLl Now, define u t (9) = X t -(X' t _ 1 ®I d )9 with 9 e M rf2p . 
Considering ' t i m ^ >LS and as values of the same function at the points #0 an d Qols, 
by the Mean Value Theorem 



-u,OLS 
Im 



90' 



with 9* between 9 OL s and 9 Q Using T 1/2 (9 OL s-Qo) = O p (l) and since du t ^(9)/d9' 
(8) Id), it follows from ([123]) and ffT2~7|) that 
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Im 



= C + limr 1 g £ ® ^ J (§ols - do) + Op(T"*) 
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= c u m + lim T" 1 V —E \u?_i ® * t _i ® Id} (Sols ~ B ) (12.16) 

+ o p (r-3). 



From (112. 2p and using arguments like in the proof of ( 112. 14|) . it is easy to see that 

£% B T ~ 1 E " E { 1 ® ^-1 ®Il} = - ®K r ) dr ®Id + °p( l ) = ~K + Op(l)- 



^The value 0* between #ols an d $0 mav be different for different components of 7„ and c" 



50 



Finally from (112. 16p we have 



t,u,OLS 
Im 



c u m - $ u m (§ Ls - 9 ) + o p (T-z] 



12.17) 



so that it follows from fl 1 2 . 1 5 [) that T l / 2 ^ % LS is asymptotically normal with covari- 
ance matrix 



S n,oL5 = Kl u + $ :a -ia 2 A-i$ : ' _ A U JA-^ - $ u m Az l A u J'. (12.18) 



The proof of (13 .7\i is very similar, here we only present a sketch. Let us define 

T 

T e (h) = T" 1 ^ t - h and c e m = vec {(T e (l), . . . , T € (m))} . 

t=h+l 

Using ( )12.1ip and (112. 14p it can be shown that 

A/^Ar^Ar 1 ), 



T2 



V 



@GLS — $0 

with H e defined in Lemma [12.31 and 

Ai 



12.19) 



/ 



V 



/ 



Ai 

From frrX6|) . (TT2T8]) and since de t -i(9)/d9' = -(X^^ ® i?;/ 1 ) we write 

T 

c£, + lim T _1 £ -J5 {e™ x ® ® if;/ 1 } - + 



7 



,GL5 



t=m+l 

= c ^-A^ GL5 -fl ) + o p (T^). 

By (112. 19p . T 1 / 2 ^ 6 * 1 " 5 is asymptotically normal with covariance matrix X ,E ' GLS ' = 
IfPm ~ ^-m^-\ l ^-m' ■ The particular case where the order of the VAR model is p = 
is an easy consequence of the arguments above in this proof. □ 
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Proof of Proposition [2] For the proof of (I3.13p . we write 

d d 

u it = h ij,t£jt and E{u%) = ^ h\ t 

3=1 3=1 

It is clear from (j!2.4p that 

lim E{u\ Tr] ) = al{r), 

where ofj(r) is the ith diagonal element of E(r). Following similar arguments used 
in Phillips and Xu (2005 p 303) for the proof of Lemma 1 (iii), we write 

T rl 



t=i 

Let us define S" = T" 1 Ylt=i u t u 't an d = Ylt=i ^tu' t - We have using again 
the Mean Value Theorem 

T 



vec(£") = vec(S") + T' 1 \ut{0) 

t=i ^_ 



du t {9) du t {6) 



with 8* is between 9ols and #00 Therefore using du t (6)/d6' = —{X' t _-y <g> Id) and 
from the consistency of 60 ls, we write 



T^vecE n = T^vecE n + oJl 



and 

T rl 



T_1 E^= / °Ur)dr + o p (l), 
t=i Jo 



so that the result follows from the Slutsky lemma. We obtain the expression (13 . 13[) 
noting that 

p° m LS = {i m ®{s u ®s u )- 1 }^Z LS - 

The proof of (I3.14p is similar to that of ( 13 . 1 3[) and hence is omitted. □ 



4 The value 9* may be different for different components of vec(S u ) and vec(S l 
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Proof of Proposition [3] In the following, c, C, ... denote constants with possibly 
different values from line to line. To simplify notation, let b denote the d(d + l)/2 
vector of bandwidths bki, 1 < k < I < d. Below we will simply write uniformly 
w.r.t. b instead of uniformly w.r.t. bki, 1 < k < I < d, and sup b instead of 
sup bfc;g g T i<k<i<d- Here the norm || ■ || is the Frobenius norm which in particular is a 
sub-multiplicative norm, that is ||1-B|| < ||1|| and for a positive definite matrix 
A, \\A\\ < C[A m i n (A)]~ 1 with C a constant depending only on the dimension of A. 
Moreover, = ||A||||S||. 

To obtain the asymptotic equivalences in equation (15. 3p it suffices to notice that 
for all 1 < i < d, of (i) — 1 = o p (l), and to prove 



sup sup \a e [i ) — a e [i) , — v p 

Ki<d b 



ojl) 



12.20) 



and 



sup 

b 



T*{T ALS {h)-T GLS {h)} =o p (l), 
for any fixed h > 1. Let us write 



12.21) 



e t - i t = -Yr t *)u t + tr*{x' t _ x ®h){6GLs-hLs) 
+(zf> - ZP){XU ® i d )0 GLS - e ) 



where \\5^\\ < \\Xt-i\\RT{b) with 



R T (b) = d{ \\0 GL s-0 ALS \\ sup 

Kt<T 



\0gls-0q\\ sup 

Kt<T 



By Lemma [12.4J -(a.b) and given that 0qls~^q — O p (T 1 ^ 2 ) and sup b \\0gls~ &als\ 



,(T 1 / 2 ), we obtain that 



sup£ r (&) = o p (T- 1 ' 2 ). 

b 



12.22) 
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From this and the moment conditions on (X t ) induced by Assumption Al, deduce 
that (112. 20p holds true. On the other hand, 



^ALs(h) - T GLS (h) 



1 T 



t=h+l 
T 



1 T 



[e t - e t )e t _ h 



t=h+l 



+ f £ & ~~ ^( l t-h - e t -h)' 

t=h+l 

--: Ri T {h) + R 2T (h) + R iT (h). 



The terms Rit(K) and R 2 x{h) could be handled in a similar manner, hence we will 
only analyze R 2 T{h). Let us write 



t=h+l 
T 



4es 



t=h+l 



By (I12.22P and the moment conditions on the innovation process (e t ), and the rate 
of convergence of 9gls and I12.4j -(b). it is clear that sup b ||-R22t(^; b) \\ = o p {T~ 1 / 2 ). 
Next let us write 

t , T 



° _1 \ 

* 2 w t e t-ft 



7' 



4 E (s 



4 - g, 3 ) «,4 



t=h+l 
T 



t=h+l t=h+l 

--■ R*\vr{h b ) + Rzi2T{h] b) + iW(fy b) + R 2UT {h] b), 



where, like in Patilea and Rai'ssi (2010), 



O O r v _ _ r v 

Si=St (b) = 2_^w t i& and E t = E t (6) = y j w u G) St- 



t=l 



t=i 



From classical matrix norm inequalities (see for instance Horn and Johnson, 1994), 
we have that for any d x d— positive definite matrices A and B, for a = 1 or a = — 1, 



|A-f-S-f|| <p.(max{||A B ||,||J3 a l|}) 



l+a 
A 2" 



l + a 



12.23) 
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where c a is a constant that depends only on d (by definition A = B° = Id). Applying 
this inequality twice we deduce 



tf> - (E t °)-i 



x (max{||E t ||,||E?||})^ (max{|| |(E?) 2 + u T I d ]-% ||(E?)- 2 ||})i 

Take the norm of R211T, use the inequality in the last display, Lemma [12. 41 (a) below, 
the moment conditions on the innovation process and the condition Tv\ — > 00 
to deduce that sup b ||-R21it(^; b)\\ = o p (T -1 / 2 ). Next, using similar matrix norm 
inequalities, Lemma [12.41 (a) and the Cauchy-Schwarz inequality, 

sup \\R21Mh; b) || < O p (l)sup{^ ||(^. 



b b 



T 

t=h+l 



U t^t-h\ 

1/2/ rp sl/2 



= O p (l)O p (T-% 1 ) (1 £ ||n t eUI| 2 ) 



where for the equality we used Lemma 7.6-(i) in Patilea and Rai'ssi (2010). Deduce 

that sup b ||-R212t(^; b)\\ = o p (T -1 / 2 ). The uniform rate of convergence for i?2i3r(^; b) 

0-1/2 _ 

is obtained after replacing E t — E t by a Taylor expansion of the power —1/2 
function for positive definite matrix, a key and apparently new ingredient we provide 
in section [12.1l below. The reminder term of the Taylor expansion could be controlled 
taking expectation, using Cauchy-Schwarz inequality and Lemma 112.41 (d). The 
term under the integral that represents the first order term of this Taylor expansion 
could be treated similarly to the term EY [Ej — w^E^ UtX^ in the proof of the 
Proposition 4.1 of Patilea and Rai'ssi (2010). That means we use the CLT for m.d. 
sequences indexed by classes of functions, see Bae, Jun and Levental (2010), see also 
Bae and Choi (1999). Here the uniformity to be considered is also with respect to 
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the integration variable v, but this can be handled with very little additional effort, 
like in Patilea and Rai'ssi (2010). The details are omitted. Finally, to derive the 
uniform order R 2 i4r{h\ b), let us write it as 

#214t(/i; b) = R 2 i4T(h; b) - R 2 u T (h; b T ) + R 2 uT(h; b T ) =: r 214T (b) + R 21iT {h\ b T ). 

The term i^ur^^r) is centered and the variance of each element of this matrix 
decreases to zero at the rate o(l/T) (use Lemma 112.41 (d) and Assumption Al' 
to derive the rate of the variance). Deduce that -R214t(^;&t) = °p(^ -1 ^ 2 )- Next 
consider the d 2 stochastic processes corresponding to the elements of r 2 i^r{b) and 
indexed by d G [c, 

of Bae, Jun and Levental (2010) to deduce that sup 6 ||r2UT 



li , c max ] where b = dbx- For each such process apply Theorem 1 



o p {T~ l l 2 ). Finally, 



deduce that sup 6 ||i?2i4T(^; ^)|| = °p(T 



-1/21 



To handle the term R^{h), let us write 



J t-h 



)u t -h + 5 



t-h 



t=h+l 
T 



t=h+l 

=: R S iT(h) + R 32T (h). 

The term R 32 T{h) could be easily handled taking the norm, using the bound on 5\ 
and Lemma ri2.41 (b) below. For Rzvr{h), we could decompose T^ 1 ^ 2 — S^ 1 ^ 2 in four 
term exactly as we did for R 2 \T{h) and apply the same techniques. The details are 
omitted and are available from the authors upon request. □ 



Lemma 12.4 Le£||-|| denote the Frobenius norm. Under the Assumptions of Propo- 
sition [3] we have: 

(a) As T —>■ oo, for a = 1 or a = — 1, we have 



sup sup 

l<t<T b£B T 
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0\a\ 



O p (l). 



(b) As T -> oo ; 



sup sup 

l<t<T b£B T 



(c) As T — » oo, 



sup 



Op(l). 



ofl). 



t=l 



(dj Is T — )• oo, 



KKT 



max E I sup || £ 4 — E t || 4 



0((1/(TM 2 ) 



12.24) 



The proof of Lemma fl 2 .41 is a direct consequence of Lemmas 7.5 and 7.6 of Patilea 
and Rai'ssi (2010) and Lemma A of Xu and Phillips (2008) applied elementwise, and 
hence will be omitted. 



Proof of Proposition [4] The notation in this proof are those of section [7J First let 
us notice that ?a(^) = #/2{1 + o(1)} for large values of x, provided the asymptotic 
law of a test statistic Qa under the null hypothesis is xtn with some m > 1. In the 
case where a test statistic Qa has the asymptotic distribution of U{5^ l LS ) defined in 
equation (14.2 j) . 

q A ( x ) = - bg P(U(5° LS ) > x) < - log P (max{<5f LS }U 2 > x^j 
= 2ma X r{^} {1 + Q(1)} ' 

and 

q A (x) = - log P(U(5Z LS ) >x)>-\ogP (max{5° LS }i:f^Uj > x) 

■{1 + 0(1)}, 



2max i {5f i5 } 

with U and Uj independent jV(0, 1) variables. Thus to prove (i) it suffices to show 
that 

jf Edr <g> (J^ EdrJ < max{5f L5 } (llldr®l}j E^ 2 (Jj^ dr ® ^ • ( 12 - 25 ) 
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Herein, for any A and B symmetric matrices, A <C B means that B — A is positive 
semidefinite. Now, in the last display, multiply both sides of the order relationship 

/ i \-l/2 ( x \l/2 

on the left and on the right by ( J Tidr ) (g) ( J Q T,dr ) and deduce that it 
suffices to prove 



I d (g)I d < max{5f L5 } 



=: max^^A" 1 . 



1 \l/2 



1 \l/2' 

Y>dr 



o 



1 \V2 

Edr) 



n 



1 \l/2' 

Tjdr 



o 



To obtain (112. 25p it remains to notice that A^ LS = I m <g> A m and that 5? LS , 1 < 
i < d 2 m are the eigenvalues of A,^ iS . 

In (ii) we suppose £(•) = er 2 (-)/ d and in this case it suffices to notice that 

J^a 2 (r)dr 



Y,dr®l}j £-* 2 (Jjldr®!, 



Jo cr 4 (r)dr 



-I d ® 7 d < / d <g> 7 d , 



where for the order relationship we use Cauchy-Schwarz inequality, while 

2 



□ 



12.1 A Taylor expansion of the matrix function f(A) = A l / 2 

Recall that the differential of a function F that maps a r x r matrix X into a r x r 
matrix F(X) is defined by the equation 

vec(dF) = df 

where f is a r 2 x 1 vector function such that f(vec(X)) = vec(F(X)). In other words, 
the (first-order) differential of F at X in the r x r matrix obtained by unstacking 
the differential of df at vec(X). See also Schott (2005), section 9.3. Basic properties 
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of vector differentials implies 



= diX'^XX- 1 / 2 ) = diX-^X 1 / 2 + X- l ' 2 d{X)X~ 1 / 2 + X l ' 2 d {X' 1 ' 2 ) . 



Now, recall that for any A positive definite matrix, the Lyapunov equation AY + 
Y A = B has a unique solution that can be represented as 



See Horn and Johnson (1991), section 6.5. All these facts brings us to the following 
technical result. 

Lemma 12.5 Let A and A be two positive definite rxr— matrices such that < c\ < 
X m i n (B) < oo for some constant c\ and B = A and B = A, where X m i n (B) is the 
smallest eigenvalue of the symmetric matrix B. Moreover, suppose that \\A — A\\ < C2 
for some small constant Ci- Then 



Jo 

where R n is ar x r— symmetric matrix with \\R n \\ < C 1 1 -^4 — A\\ 2 and C is a constant 
depending only on c\, Ci- 

Proof of Lemma 112.51 Let A be some arbitrary matrix. By Taylor expansion, for 
sufficiently small values of e and for some matrices Gi, i = 1, 2, ... 



(A + sA)~ 1/2 = A~ l/2 + eG x + e 2 G 2 + ... = A~ 1/2 + eG x + R x (12.26) 



with H-Rill < C\E 2 and C\ a constant depending on c\ and the norm of A. This kind 
of representation could be derived from a Taylor formula for the vector function 
f defined by the equation f(vec{X)) = vec{X~ 1 ^ 2 ) considered in a neighborhood 
of the vec(A) for a positive definite matrix A. See, for instance, Schott (2005), 



Y 





n 
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section 9.6. On the other hand, recall that for B a square matrix with \\B\\ < 1, 
(I — B)~ l = I + B + B 2 + ... = I + B + R2 where R2 is the reminder of the expansion 
with H-R2II < ll-S|| 2 (l — ll-Bll)" 1 - Thus for sufficiently small values of we can write 

(A + eAy 1 = A~ l ' 2 (I + eA~ l / 2 AA~ l ' 2 y l A~ l l 2 (12.27) 
= A~ 1/2 (I - eA-^AA-V 2 + e 2 A-y 2 AA- 1 AA- 1 ^ + ...)A~ l l 2 

Taking the square on both sides of the first equality in (112. 26p and identifying the 
coefficients of the power of e in equation (112. 27p deduce that G± is the solution of 
the Lyapunov equation 

A- 1 ' 2 Y + YA- 1 ' 2 = -A~ 1 AA- 1 . 
Finally, the result follows by taking A = (A - A)/\\A - A\\ and e = \\A - A\\. □ 
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13 Appendix B: Tables and Figures 

Table 1: Empirical size (in %) of the portmanteau tests with iid standard Gaussian errors. 
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Figure 1: The asymptotic variance Egls(2, 2) on the left and the ratio Eols(6, 6)/Sq LS (6, 6) 
on the right for t\ = ti in Example 13.11 
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Table 2: Empirical size (in %) of the portmanteau tests. The innovations are heteroscedas- 
tic with an abrupt break at T/2. 



base 


m = 5 


m — 15 


T 


50 


100 


200 


50 


100 


200 


LB s m 


27.9 


35.3 


40.1 


35.7 


63.0 


76.7 


LB OLS 


4.5 


3.3 


4.8 


5.4 


6.1 


6.0 


LB ALS 


3.2 


3.7 


5.0 


3.8 


3.8 


3.9 


LB CLS 


2.6 


4.2 


5.7 


3.7 


4.2 


4.7 


OLS 

LB m 


28.9 


13.8 


9.7 


30.9 


21.9 


15.2 


ALS 


18.6 


8.7 


7.1 


35.0 


15.9 


9.0 


GLS 

LB m 


6.4 


5.3 


6.3 


12.5 


10.0 


9.7 



Table 3: Empirical size (in %) of the portmanteau tests. The innovations are heteroscedas- 
tic with trending behaviour. 
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5.7 


5.3 


6.3 


10.8 


9.7 


9.2 
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Table 4: The empirical means and standard deviations of the weights in the sums 
(14. 2p . (15. 4p and their GLS counterparts over the N = 1000 iterations. The innova- 
tions are heteroscedastic with trending behaviour. 



i 


i 


2 


3 


4 


5 


$ols 


0.02[ .o2] 


0.06[o.o4] 


0-11(0.06] 


0.2[o.o9] 


0-89(0.11] 


£als 


0-03[ .o2] 


0.05[o.o2] 


0-09(0.03] 


0-11(0.03] 


l-00[o.oo] 




0.05[o.o4] 


0.06[o.o4] 


0-15(0.08] 


0-17(0.08] 


l-00[o.oo] 


i 


6 


7 


8 


9 


10 




0.89 [0 .ii] 


0-89[o.ii] 


0.91[o.ii] 


1-11 [0.13] 


1-11 [0.13] 


ials 


l-00[ .oo] 


l-00[o.oo] 


1.00[o.oo] 


l-00[o.oo] 


l-00[o.oo] 




l-00[o.oo] 


l-00[o.oo] 


1.00[o.oo] 


l-00[o.oo] 


l-00[o.oo] 


i 


11 


12 


13 


14 


15 


gols 


1-H[0.13] 


1-12(0.13] 


1.36[o.i7] 


l-36[o.i7] 


l-36[o.i7] 


ials 
u i 


l-00[ .oo] 


l-00[o.oo] 


1.00[o.oo] 


l-00[o.oo] 


l-00[o.oo] 




l-00[ .oo] 


l-00[o.oo] 


l.OOjo.oo] 


l-00[o.oo] 


l-00[o.oo] 


i 


16 


17 


18 


19 


20 


Sols 


l-36[ .i7] 


1-69(0.27] 


1-71(0.27] 


1-71(0.27] 


1-71 [0.27] 


ials 


l-00[ .oo] 


l-00[o.oo] 


1.00[o.oo] 


l-00[o.oo] 


l-00[o.oo] 




l-00[ .oo] 


l-00[o.oo] 


l-00[o.oo] 


l-00[o.oo] 


l-00[o.oo] 



Table 5: The estimators of the autoregressive parameters of the VAR(l) model for the 
balance data for the U.S.. 



Parameter 


0i 


2 


03 


04 


ALS estimate 


0-33(0.08] 


0-02[o.o2] 


-0.35[o.3o] 


-0.07[o.o8] 


OLS estimate 


0-45(0.23] 


0-00(0.02] 


-1-02(0.60] 


0-1(0.17] 
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Table 6: The balance data for the U.S.: the p-values of the ARCH-LM tests (in %) for 
the components of the ALS-residuals of a VAR(l). 



lags 


2 


5 


10 


ht 


22.26 


45.05 


36.44 




25.32 


73.32 


77.18 



Table 7: The p-values of the portmanteau tests (in %) for the checking of the adequacy 
of the VAR(l) model for the U.S. trade balance data. 



m 


5 


15 


LB s m 


0.00 


0.01 


lb ols 


50.80 


99.94 


LB ALS 


6.36 


15.95 


OLS 

LB m 


0.00 


7.87 


ALS 


5.61 


15.50 



Table 8: The balance data for the U.S.: the test statistics of the portmanteau tests used 

~OLS ~ ALS 

for checking the adequacy of the VAR(l) model. The Q m and Q m correspond to the 
statistics of the LB version of the Katayama portmanteau tests with standard asymptotic 
distribution. 



m 


5 


15 


Q° m LS 


6.84 


106.34 


f)ALS 


25.73 


66.83 


Q OLS 

—m 


6.84 


106.34 


Q ALS 

—m 


48.73 


66.83 
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Table 9: The estimators of the autoregressive parameters of the VAR(4) model for the 
U.S. energy-transportation price indexes. 



Parameter 


#1 


# 2 


#3 


# 4 


ALS estimate 
OLS estimate 


0.36[o.o8] 

U- '4[o.32] 


0.37[o.i4] 

1 no 

J--US[0.67] 


0-10(0.04] 

-0.08[o.i3] 


0-43(0.08] 

n in 
U.lU[o.28] 


Parameter 


#5 


# 6 


# 7 




ALS estimate 
OLS estimate 


0.06[o.o8] 

-U.OCJ[ .35] 


—0.02[o.i4] 

-J-.^D[0.73] 


-0.09(0.04] 

n in 
0-10(0.14] 


-0.13[o.o8] 

U.^ ' [0.30] 


Parameter 




#10 




#12 


ALS estimate 
OLS estimate 


0.18[o.o8] 

0-21(0.24] 


0.13[o.i4] 

0-13(0.52] 


-0.05(0.04] 
-0.05[o.i4] 


0-01(0.08] 

0.03[o.3i] 


Parameter 


#13 


#14 


#15 


#16 


ALS estimate 
OLS estimate 


0-17(0.08] 

0-32(0.26] 


0.15[o.i4] 
0.64[o. 57] 


—0.07[o.o4] 
-0.17[o.i5] 


0.03[o.o8] 

0-31(0.32] 



Table 10: The p-values of the portmanteau tests (in %) for the checking of the adequacy 
of the VAR(4) model for the U.S. energy-transportation price indexes (n.a.: not available). 



m 


3 


6 


12 


LB s m 


n.a. 


2.08 


0.00 


lb ols 


100.00 


100.00 


100.00 


LB ALS 


85.44 


98.72 


10.07 


OLS 

LB m 


n.a. 


n.a. 


n.a. 


• ALS 

LB m 


n.a. 


n.a. 


n.a. 
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Table 11: VAR modeling of the energy-transportation price indexes: the test statistics of 
the portmanteau tests used for checking the adequacy of the VAR(4) model. 



m 


3 


6 


12 


Q?n LS 


1.87 


18.06 


117.51 


Qi LS 


5.03 


9.50 


57.69 


Q OLS 

—m 


n.a. 


n.a. 


n.a. 


Q ALS 

—m 


n.a. 


n.a. 


n.a. 



Table 12: VAR modeling of the energy-transportation price indexes: the weights of the 
non standard distributions of the portmanteau tests used for checking the adequacy of the 
VAR(4) model with m = 3. 



i 


l 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


gols 


0.05 


0.48 


1.94 


2.05 


2.66 


2.85 


4.97 


6.58 


10.51 


16.06 


64.14 


312.18 


xals 


0.01 


0.09 


0.24 


0.89 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 




Fi gure 2: The same as in Figure Q] but for t\ ^ T2 in general. 
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50 100 200 300 50 100 200 300 50 100 200 300 



Figure 3: Empirical power (in %) of the portmanteau tests with m = 10. The adequacy of a 
VAR(l) model to VAR(2) processes is tested. The innovations are homoscedastic on the right. 
The variance exhibits a break at T/2 on the left and have a trending behavior in the middle. 




50 100 150 200 50 100 150 200 



Figure 4: Empirical power (in %) of the portmanteau tests with m = 10. The non correlation 
of VAR(l) processes is tested. The variance have a trending behavior on the left and exhibits an 
abrupt shift on the right. 
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" 1975.3 1980.3 1985.3 1990.3 1995.3 2000.3 2005.3 1 1975.3 1980.3 1985.3 1990.3 1995.3 2000.3 2005.3 

Figure 5: The balance on merchandise trade for the U.S. on the left and the balance on services 
for the U.S. on the right in billions of dollars from 1/1/1970 to 10/1/2009, T=160. Data source: 
The research division of the federal reserve bank of Saint Louis, www.research.stlouis.org. 




Figure 6: The differences of the balance on merchandise trade (on the left) and of the balance 
on services for the U.S. (on the right). 
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Figure 7: The cross validation score (CV) for the ALS estimation of the VAR(l) model for the 
differences of the balance on merchandise trade and on services in the U.S.. 
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Figure 8: The ALS residuals of a VAR(l) for the differences of the balance on merchandise trade 
and on services in the U.S.. The first component of the ALS residuals is on the left and the second 
is on the right. 
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Figure 9: The same as in Figure [8] but for the OLS residuals. 
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Figure 10: The balance data for the U.S.: the autocorrelations of the squares of the first compo- 
nent of the ALS residuals (on the left) and of the second component of the ALS residual (on the 
right) . 
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Figure 11: The balance data for the U.S.: the logarithms of the uf t 's (full line) and the logarithms 
of the non parametric estimates of Var(wi t ) (dotted line) on the left and the same for the W2t' s an d 
Var(w,2t) on the right. 
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Figure 12: The balance data for the U.S.: estimation of the correlation between of the components 
of the error process. 
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Figure 13: The balance data for the U.S.: the ALS residual autocorrelations -R^lsC 1 ) ( on the 
left) and R\\s{h) (on the right), with obvious notations. The 95% confidence bounds are obtained 
using ([377]) and (|5~3| . 
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Figure 14: The same as in Figure IT31 but for R 2 A LS (h) (on the left) and R A 2 LS (h) (on the right) 




Figure 15: The balance data for the U.S.: the OLS residual autocorrelations Ro^g{h) (on the 
left) and RoLg(h) (on the right). The full lines 95% confidence bounds are obtained using (j3.13[) . 
The dotted lines 95% confidence bounds are obtained using the standard result (|3.9|) . 




Figure 16: The same as in Figure [T3] but for #o LS (/i) ( on tne l ert ) ancl RqlsW ( on tne right). 
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Figure 17: The energy price index (full line) and the transportation price index (dotted line) in 
the U.S. from 1/1/1957 to 2/1/2011, T=648. Data source: The research division of the federal 
reserve bank of Saint Louis, www.research.stlouis.org. 
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1958.01 1964.09 1971.05 1978.01 1984.09 1991.05 1998.01 2004.09 1956.01 1964.09 1971.05 1978.01 1984.09 1991.05 1998.01 2004.09 

Figure 18: The differences of the energy price (on the left) and of the transportation price indexes 



for the U.S. (on the right). 




Figure 19: The cross validation score (CV) for the ALS estimation of the VAR(4) model for the 
differences of the energy-transportation price indexes in the U.S.. 
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Figure 20: The ALS residuals of a VAR(4) model for the differences of the energy and trans- 
portation price indexes for the U.S.. The first component of the ALS residuals is on the left and 
the second is on the right. 
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Figure 21: The same as in Figure[5]but for the OLS residuals. 
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Figure 22: The energy-transportation data for the U.S.: the logarithms of the u lt 's (full line) 
and the logarithms of the non parametric estimation of Var(ui t ) (dotted line) on the left and the 
same for the w| t 's and Var(?i2t) on the right. 
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Figure 23: The energy-transportation data for the U.S.: estimation of the correlation between 
the components of the error process. 
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Figure 24: The energy-transportation data for the U.S.: the ALS residual autocorrelations 
R^Lsih) (on the left) and R^lsO 1 ) ( on the right), with obvious notations. The 95% confidence 
bounds are obtained using ()3.7p and (|5.3p . 
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Figure 25: The same as in Figure [24] but for R A \ s {h) ( on tne l ert ) anc ^ RalsW ( on tne right). 
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Figure 26: The energy-transportation data for the U.S.: the OLS residual autocorrelations 
RolsO 1 ) ( on the left) and RolsQ 1 ) ( on t ne right). The full line 95% confidence bounds are 
obtained using (|3.13|) . The dotted lines 95% confidence bounds are obtained using the standard 
result 43U). 
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Fi gure 28: The energy-transportation data for the U.S.: the OLS residual autocorrelations 
RoLsiti) (on the left) and R^ L g{h) (on the right). The dotted lines 95% confidence bounds are 
obtained using the standard result (|3.9[) . 
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Figure 29: The same as in Figure [28] but for R 2 Q LS {h) (on the left) and R}y LS (h) (on the right). 
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